際際滷

際際滷Share a Scribd company logo
AIDR	Tutorial	
Muhammad	Imran	
Research	Scien1st	
Qatar	Compu1ng	Research	Ins1tute,	HBKU	
Doha,	Qatar	
h"p://aidr.qcri.org/
Outline	
≒ Data	collec2on	in	AIDR	
≒ Data	classi鍖ca2on	in	AIDR	
≒ Data	view/download	in	AIDR
Data	Collec2on	in	AIDR	
≒ Twi:er	data	collec2on	strategies	that	AIDR	supports	
 By	keywords	
 By	geographical	regions	
≒ Strict:	coordinates	strictly	inside	geo	boundaries	
≒ Approximate:	tweets	from	a	place	that	overlaps	with	the	geo	
boundaries.	
 By	following	Twi:er	users	
 By	keywords	+	regions	
≒ Tweets	that	match	any	of	the	keywords	and	within	the	geo	
boundaries.
Data	Collec2on	Using	Keywords	
≒ Keywords	limit	=	400	
≒ One	keyword	could	a	single	word	like	
Su鍖olk	or	a	phrase	Su鍖olk	accident	
≒ 1	keyword/phrase	cannot	be	more	than	60	
bytes	(1	char	=	1	byte)	
≒ Generic	keywords	collect	irrelevant	tweets	
≒ Speci鍖c	keywords	most	likely	collect	relevant	
tweets
Keywords	Examples
Loca2on-based	Collec2on	
≒ Bounding	boxes	do	not	act	as	鍖lters	for	other	鍖lter	
parameters.	For	example	:	
keyword=twi:er&loca2ons=-122.75,36.8,-121.75,37.8	
	would	match	any	tweets	containing	the	term	Twi:er	(even	
	non-geo	tweets)	OR	coming	from	the	San	Francisco	area.
Following	Twi:er	Users	
For	each	user	speci鍖ed,	the	tool	will	collect:	
≒ Tweets	created	by	the	user.	
≒ Tweets	which	are	retweeted	by	the	user.	
≒ Replies	to	any	Tweet	created	by	the	user.	
≒ Retweets	of	any	Tweet	created	by	the	user.	
≒ Manual	replies,	created	without	pressing	a	reply	bu:on	(e.g.	
@twi:erapi	I	agree).	
The	tool	will	not	contain:	
≒ Tweets	men2oning	the	user	(e.g.	Hello	@twi:erapi!).	
≒ Manual	Retweets	created	without	pressing	a	Retweet	bu:on	(e.g.	
RT	@twi:erapi	The	API	is	great).	
≒ Tweets	by	protected	users.	
Use	comma-separated	list	of	TwiFer	user	id	(hFp://geFwiFerid.com/)
AIDR Tutorial (Artificial Intelligence for Disaster Response)
Classi鍖er	UI
Detailed	Informa2on	of	Classi鍖ers
Data	Classi鍖ca2on	in	AIDR	
≒ De鍖ne	classi鍖ers	(name,	descrip2on)	
De鍖ne	labels	(name,	descrip2on)	
Having	a	miscellaneous	category	will	be	helpful	
≒ Wait	around	15-20	minutes	(for	fast	
collec2ons)	and	30-40	minutes	(for	slow	
collec2on)	
≒ Start	tagging
Classi鍖er	Genera2on	
≒ Check	the	classi鍖er	status	(UI)	
 First	classi鍖er/model	will	be	up	ager	50	labeled	
tweets,	ideally	equally	distributed	among	labels	
 If	no	model	appears	ager	50	tags,	keep	tagging	
≒ Human-tagged	items	(the	more	the	be:er)	
≒ 40	more	needed	to	re-train	(next	classi鍖er	target)	
≒ Machine-tagged	items	(keep	an	eye	on	
misclassi鍖ca2ons)	
≒ Quality	(ideally	should	be	90	<	AUC	!=	100)

More Related Content

What's hot (6)

PPT
Huri Search 2008 Huridocs
huridocs
PPTX
Managing errata and retractions with CrossMark
Crossref
PPT
PoolParty SKOS and Linked Data
Andreas Blumauer
PPT
A Privacy Preference Ontology (PPO) for Linked Data
Owen Sacco
PDF
New Initiatives - Geoffrey Bilder - London LIVE 2017
Crossref
PPTX
Session 02 - Object Identification - Part 1
SiddharthSelenium
Huri Search 2008 Huridocs
huridocs
Managing errata and retractions with CrossMark
Crossref
PoolParty SKOS and Linked Data
Andreas Blumauer
A Privacy Preference Ontology (PPO) for Linked Data
Owen Sacco
New Initiatives - Geoffrey Bilder - London LIVE 2017
Crossref
Session 02 - Object Identification - Part 1
SiddharthSelenium

More from Muhammad Imran (16)

PPTX
Processing Social Media Messages in Mass Emergency: A Survey
Muhammad Imran
PPTX
Damage Assessment from Social Media Imagery Data During Disasters
Muhammad Imran
PPTX
Image4Act: Online Social Media Image Processing for Disaster Response
Muhammad Imran
PDF
Real-Time Processing of Social Media Content for Social Good
Muhammad Imran
PPTX
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
Muhammad Imran
PPTX
Summarizing Situational Tweets in Crisis Scenario
Muhammad Imran
PDF
The Role of Social Media and Artificial Intelligence for Disaster Response
Muhammad Imran
PPTX
Introduction to Machine Learning: An Application to Disaster Response
Muhammad Imran
PPTX
Artificial Intelligence for Disaster Response
Muhammad Imran
PDF
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
Muhammad Imran
PDF
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Muhammad Imran
PPTX
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Muhammad Imran
PPTX
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Muhammad Imran
PPTX
Domain Specific Mashups
Muhammad Imran
PPTX
Reseval Mashup Platform Talk at SECO
Muhammad Imran
PPTX
ResEval: Resource-oriented Research Impact Evaluation platform
Muhammad Imran
Processing Social Media Messages in Mass Emergency: A Survey
Muhammad Imran
Damage Assessment from Social Media Imagery Data During Disasters
Muhammad Imran
Image4Act: Online Social Media Image Processing for Disaster Response
Muhammad Imran
Real-Time Processing of Social Media Content for Social Good
Muhammad Imran
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
Muhammad Imran
Summarizing Situational Tweets in Crisis Scenario
Muhammad Imran
The Role of Social Media and Artificial Intelligence for Disaster Response
Muhammad Imran
Introduction to Machine Learning: An Application to Disaster Response
Muhammad Imran
Artificial Intelligence for Disaster Response
Muhammad Imran
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
Muhammad Imran
Coordinating Human and Machine Intelligence to Classify Microblog Communica0o...
Muhammad Imran
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Me...
Muhammad Imran
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Muhammad Imran
Domain Specific Mashups
Muhammad Imran
Reseval Mashup Platform Talk at SECO
Muhammad Imran
ResEval: Resource-oriented Research Impact Evaluation platform
Muhammad Imran
Ad

Recently uploaded (20)

PPTX
UserCon Belgium: Honey, VMware increased my bill
stijn40
PPTX
Enabling the Digital Artisan keynote at ICOCI 2025
Alan Dix
PDF
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
PDF
The Growing Value and Application of FME & GenAI
Safe Software
PDF
Scaling i.MX Applications Processors Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
PPTX
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
PDF
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
PDF
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
PDF
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
PDF
MPU+: A Transformative Solution for Next-Gen AI at the Edge, a Presentation...
Edge AI and Vision Alliance
PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
PPTX
叶Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
PPTX
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
PDF
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
PDF
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
UserCon Belgium: Honey, VMware increased my bill
stijn40
Enabling the Digital Artisan keynote at ICOCI 2025
Alan Dix
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
The Growing Value and Application of FME & GenAI
Safe Software
Scaling i.MX Applications Processors Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
MPU+: A Transformative Solution for Next-Gen AI at the Edge, a Presentation...
Edge AI and Vision Alliance
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
叶Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
Why aren't you using FME Flow's CPU Time?
Safe Software
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
Ad

AIDR Tutorial (Artificial Intelligence for Disaster Response)