ݺߣ

ݺߣShare a Scribd company logo
Fashion 10000
An Enriched Dataset of
Fashion and Clothing
Presentation: Michael Riegler, Klagenfurt University & TU Delft
Babak Loni, TU Delft
Lei Yen Cheung, TU Delft
Alessandro Bozzon, TU Delft
Luke Gottlieb, ICSI
Martha Larson, TU Delft
Table of Content
? Introduction
? Dataset Collection
? Dataset Annotation
C Statistics
? Applications of Dataset
? Conclusion
The Dataset
? Social Images
? At least 10000 fashion-
related images
? Social metadata
? Creative Common images
? Annotated with different
labels
The Collection
Wikipedia
470 Fashion
Categories
Flickr
- Query only CC
attribution images
- Query should also
appear in tags
- Top relevant images
32K Images
262 Categories
Flickr Fashion
10000
+ MTurk Annotations
+ Metadata
Metadata
? Collected in xml and csv format
C Title, description, owner, Tags, Location,
geo-parameters
? Additional metadata: Info, Geos, Context,
Tags, Notes, Favorites, Urls, Comments
General Statistics
Pairs fashion item, photo 32,398
Number of distinct fashion categories 262
Max/avg/min nr of photos per fashion item 200/ 122.95 / 10
Number of photos with geo annotations 7,933
Total number of comments 58,578
Max/avg/min nr of comments per photo 575 / 7.35/ 1
Total number of tags, photo pairs 460,907
Total number of distinct tags 56,275
Max/avg/min nr of tags per photo 136/ 15.15/ 1
Total number of notes, photo pairs 5,892
Max/avg/min nr of notes per photo 195/ 5.31/ 1
Total number of favorites 37,131
Max/avg/min nr of favorites per photo 20/ 3.61/ 1
Total number of contexts 110,505
Max/avg/min nr of contexts per photo 206/ 3.93/ 1
General Statistics
Pairs fashion item, photo 32,398
Number of distinct fashion categories 262
Max/avg/min nr of photos per fashion item 200/ 122.95 / 10
Number of photos with geo annotations 7,933
Total number of comments 58,578
Max/avg/min nr of comments per photo 575 / 7.35/ 1
Total number of tags, photo pairs 460,907
Total number of distinct tags 56,275
Max/avg/min nr of tags per photo 136/ 15.15/ 1
Total number of notes, photo pairs 5,892
Max/avg/min nr of notes per photo 195/ 5.31/ 1
Total number of favorites 37,131
Max/avg/min nr of favorites per photo 20/ 3.61/ 1
Total number of contexts 110,505
Max/avg/min nr of contexts per photo 206/ 3.93/ 1
General Statistics
Pairs fashion item, photo 32,398
Number of distinct fashion categories 262
Max/avg/min nr of photos per fashion item 200/ 122.95 / 10
Number of photos with geo annotations 7,933
Total number of comments 58,578
Max/avg/min nr of comments per photo 575 / 7.35/ 1
Total number of tags, photo pairs 460,907
Total number of distinct tags 56,275
Max/avg/min nr of tags per photo 136/ 15.15/ 1
Total number of notes, photo pairs 5,892
Max/avg/min nr of notes per photo 195/ 5.31/ 1
Total number of favorites 37,131
Max/avg/min nr of favorites per photo 20/ 3.61/ 1
Total number of contexts 110,505
Max/avg/min nr of contexts per photo 206/ 3.93/ 1
General Statistics
Pairs fashion item, photo 32,398
Number of distinct fashion categories 262
Max/avg/min nr of photos per fashion item 200/ 122.95 / 10
Number of photos with geo annotations7,933
Total number of comments 58,578
Max/avg/min nr of comments per photo 575 / 7.35/ 1
Total number of tags, photo pairs 460,907
Total number of distinct tags 56,275
Max/avg/min nr of tags per photo 136/ 15.15/ 1
Total number of notes, photo pairs 5,892
Max/avg/min nr of notes per photo 195/ 5.31/ 1
Total number of favorites 37,131
Max/avg/min nr of favorites per photo 20/ 3.61/ 1
Total number of contexts 110,505
Max/avg/min nr of contexts per photo 206/ 3.93/ 1
Dataset
Annotation
? Some images might
not be relevant to
fashion and clothing
? The ground truth
differentiates relevant
from non-relevant
Dataset
Annotation
? We used AMT to create ground
truth for the images
? The fashion category is described
with a definition from Wikipedia
? 6 questions to create 6 labels for
each of the images
? We also ask about familiarity of
workers with the fashion category
HIT Design
HIT Design
HIT Questions (Labels)
Question Possible values
Q1) Fashion / Clothing Related yes C no - notsure
Q2) Specialty clothing item (image
Category)
yes C no - notsure
Q3) Number of people nopeople C onepeople - manypeople
Q4) Professional model or not? yes C no C notapp (not applicable)
Q5) Person wearing fashion? yes C no C noperson C notapp (not
applicable)
Q6) Formal / Informal formalmen - formalwomen -
informalmen informalwomen C other
(cross-dressing or multiple persons) C
notapp (not applicatble)
Annotation Statistics
Total number of assignments 24,457
% of rejected assignments 4 %
Total number of unique workers 1470
Avg. number of assignment by each worker 17
Avg. Completion time 127 sec
Avg. familiarity of workers with fashion items 5.8 (range 1-7)
Question 1 2 3 4 5 6
Kappa Value 0.66 0.65 0.85 0.51 0.38 0.48
Dataset Statistics
? Using the generated ground truth the
statistics about the images were calculated
Number of fashion related images 18,487
Number of images with many people 7,417
Number of images with one person 9,771
Number of images with no person 13,179
Number of images with intention of showing fashion 9,096
Number of professional fashion images 2,814
Applications of the
Dataset
? Developing social media content analysis
C Game with a purpose (domino game)
? Basis for the brave new task in MediaEval
multimedia benchmarking initiative
? Use case for the proof of intentional framing
Conclusion
? Fashion dataset
? Six different labels
? AMT generated ground
truth
? Can be used in various
research areas
? Evaluated in the MediaEval
Benchmark
Michael Riegler
m.a.riegler@tudelft.nl
Thank you!

More Related Content

Viewers also liked (8)

Womens fashion vietnam
Womens fashion vietnamWomens fashion vietnam
Womens fashion vietnam
alicia_obbi
?
Anupama: The marketing plan for Android App
Anupama: The marketing plan for Android AppAnupama: The marketing plan for Android App
Anupama: The marketing plan for Android App
Shivansh Surana
?
Top 10 brands in Bangladesh
Top 10 brands in Bangladesh Top 10 brands in Bangladesh
Top 10 brands in Bangladesh
Meherab Uddin Shakal
?
Inframe Production Profile
Inframe Production ProfileInframe Production Profile
Inframe Production Profile
InframeP
?
Study on yellow ( clothing ) beximco
Study on yellow ( clothing ) beximcoStudy on yellow ( clothing ) beximco
Study on yellow ( clothing ) beximco
Tamimur Islam
?
Brand study report: Aarong
Brand study report: AarongBrand study report: Aarong
Brand study report: Aarong
MD. SAJJADUL KARIM BHUIYAN
?
10 Common Mistakes that people make when marketing their Fashion Brand
10 Common Mistakes that people make when marketing their Fashion Brand10 Common Mistakes that people make when marketing their Fashion Brand
10 Common Mistakes that people make when marketing their Fashion Brand
Ullash Tiwari
?
RZˤ `MϤ?Ҏt룭ڣR㏊|
RZˤ `MϤ?Ҏt룭ڣR㏊|RZˤ `MϤ?Ҏt룭ڣR㏊|
RZˤ `MϤ?Ҏt룭ڣR㏊|
Koichi Hamada
?
Womens fashion vietnam
Womens fashion vietnamWomens fashion vietnam
Womens fashion vietnam
alicia_obbi
?
Anupama: The marketing plan for Android App
Anupama: The marketing plan for Android AppAnupama: The marketing plan for Android App
Anupama: The marketing plan for Android App
Shivansh Surana
?
Inframe Production Profile
Inframe Production ProfileInframe Production Profile
Inframe Production Profile
InframeP
?
Study on yellow ( clothing ) beximco
Study on yellow ( clothing ) beximcoStudy on yellow ( clothing ) beximco
Study on yellow ( clothing ) beximco
Tamimur Islam
?
10 Common Mistakes that people make when marketing their Fashion Brand
10 Common Mistakes that people make when marketing their Fashion Brand10 Common Mistakes that people make when marketing their Fashion Brand
10 Common Mistakes that people make when marketing their Fashion Brand
Ullash Tiwari
?
RZˤ `MϤ?Ҏt룭ڣR㏊|
RZˤ `MϤ?Ҏt룭ڣR㏊|RZˤ `MϤ?Ҏt룭ڣR㏊|
RZˤ `MϤ?Ҏt룭ڣR㏊|
Koichi Hamada
?

Similar to Fashion 10000: An Enriched Dataset of Fashion and Clothing (20)

A Year of Data Science at Metail
A Year of Data Science at MetailA Year of Data Science at Metail
A Year of Data Science at Metail
Matt McDonnell
?
Presentation U Commerce Dec 08
Presentation U Commerce Dec 08Presentation U Commerce Dec 08
Presentation U Commerce Dec 08
jsymonds
?
Nidod job mergers_v1.0
Nidod job mergers_v1.0Nidod job mergers_v1.0
Nidod job mergers_v1.0
Jim "Brodie" Brazell
?
UEMS EACCME 2013 Scoopit cme/cpd
UEMS EACCME 2013 Scoopit cme/cpdUEMS EACCME 2013 Scoopit cme/cpd
UEMS EACCME 2013 Scoopit cme/cpd
OKCC/C3O and CREM/universit Lorraine
?
2015 engineers' content and online marketing preferences webcast
2015 engineers' content and online marketing preferences webcast2015 engineers' content and online marketing preferences webcast
2015 engineers' content and online marketing preferences webcast
ENGINEERING.com
?
Viral news: How to predict news sharing based on article characteristics
Viral news: How to predict news sharing based on article characteristicsViral news: How to predict news sharing based on article characteristics
Viral news: How to predict news sharing based on article characteristics
Department of Communication Science, University of Amsterdam
?
Actc2012 Scoopit Cytomics
Actc2012 Scoopit CytomicsActc2012 Scoopit Cytomics
Actc2012 Scoopit Cytomics
OKCC/C3O and CREM/universit Lorraine
?
World CTCUS2012 Scoopit Cytomics
World CTCUS2012 Scoopit CytomicsWorld CTCUS2012 Scoopit Cytomics
World CTCUS2012 Scoopit Cytomics
OKCC/C3O and CREM/universit Lorraine
?
Reading quiz2
Reading quiz2Reading quiz2
Reading quiz2
Ashley Hungerford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
A Brief (and Practical) Introduction to Information Architecture
A Brief (and Practical) Introduction to Information ArchitectureA Brief (and Practical) Introduction to Information Architecture
A Brief (and Practical) Introduction to Information Architecture
Louis Rosenfeld
?
Documentary Entry Nhd
Documentary Entry NhdDocumentary Entry Nhd
Documentary Entry Nhd
Houston ISD
?
Site search analytics workshop presentation
Site search analytics workshop presentationSite search analytics workshop presentation
Site search analytics workshop presentation
Louis Rosenfeld
?
Mis 535 Education Specialist-snaptutorial.com
Mis 535 Education Specialist-snaptutorial.comMis 535 Education Specialist-snaptutorial.com
Mis 535 Education Specialist-snaptutorial.com
robertledwes14
?
The Firma Model
The Firma ModelThe Firma Model
The Firma Model
Jason Hobbs
?
The Firma Model
The Firma ModelThe Firma Model
The Firma Model
Firma
?
Mi0040 technology management
Mi0040   technology managementMi0040   technology management
Mi0040 technology management
smumbahelp
?
A Year of Data Science at Metail
A Year of Data Science at MetailA Year of Data Science at Metail
A Year of Data Science at Metail
Matt McDonnell
?
Presentation U Commerce Dec 08
Presentation U Commerce Dec 08Presentation U Commerce Dec 08
Presentation U Commerce Dec 08
jsymonds
?
2015 engineers' content and online marketing preferences webcast
2015 engineers' content and online marketing preferences webcast2015 engineers' content and online marketing preferences webcast
2015 engineers' content and online marketing preferences webcast
ENGINEERING.com
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010Fundamentals Of Data Mining 2010
Fundamentals Of Data Mining 2010
Jim Stafford
?
A Brief (and Practical) Introduction to Information Architecture
A Brief (and Practical) Introduction to Information ArchitectureA Brief (and Practical) Introduction to Information Architecture
A Brief (and Practical) Introduction to Information Architecture
Louis Rosenfeld
?
Documentary Entry Nhd
Documentary Entry NhdDocumentary Entry Nhd
Documentary Entry Nhd
Houston ISD
?
Site search analytics workshop presentation
Site search analytics workshop presentationSite search analytics workshop presentation
Site search analytics workshop presentation
Louis Rosenfeld
?
Mis 535 Education Specialist-snaptutorial.com
Mis 535 Education Specialist-snaptutorial.comMis 535 Education Specialist-snaptutorial.com
Mis 535 Education Specialist-snaptutorial.com
robertledwes14
?
The Firma Model
The Firma ModelThe Firma Model
The Firma Model
Firma
?
Mi0040 technology management
Mi0040   technology managementMi0040   technology management
Mi0040 technology management
smumbahelp
?

Recently uploaded (20)

ABA_in_plant_abiotic_stress_mitigation1.ppt
ABA_in_plant_abiotic_stress_mitigation1.pptABA_in_plant_abiotic_stress_mitigation1.ppt
ABA_in_plant_abiotic_stress_mitigation1.ppt
laxmichoudhary77657
?
Seymour Benzer's experiment and complementation test
Seymour Benzer's experiment and complementation testSeymour Benzer's experiment and complementation test
Seymour Benzer's experiment and complementation test
AkankshaSindhiya
?
GALILEO'S OBSERVATION ni Karlo Mariano.pptx
GALILEO'S OBSERVATION ni Karlo Mariano.pptxGALILEO'S OBSERVATION ni Karlo Mariano.pptx
GALILEO'S OBSERVATION ni Karlo Mariano.pptx
ejrguillermo
?
Leafcurl viral disease presentation.pptx
Leafcurl viral disease presentation.pptxLeafcurl viral disease presentation.pptx
Leafcurl viral disease presentation.pptx
Mir Ali M
?
Propagation of electromagnetic waves in free space.pptx
Propagation of electromagnetic waves in free space.pptxPropagation of electromagnetic waves in free space.pptx
Propagation of electromagnetic waves in free space.pptx
kanmanivarsha
?
(Journal Club) - Integration of multiple lineage measurements from the same c...
(Journal Club) - Integration of multiple lineage measurements from the same c...(Journal Club) - Integration of multiple lineage measurements from the same c...
(Journal Club) - Integration of multiple lineage measurements from the same c...
David Podorefsky, PhD
?
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
David Podorefsky, PhD
?
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Srgio Sacani
?
Cell Structure & Function | Cambridge IGCSE Biology
Cell Structure & Function | Cambridge IGCSE BiologyCell Structure & Function | Cambridge IGCSE Biology
Cell Structure & Function | Cambridge IGCSE Biology
Blessing Ndazie
?
History of atomic layer deposition (ALD) in a nutshell
History of atomic layer deposition (ALD) in a nutshellHistory of atomic layer deposition (ALD) in a nutshell
History of atomic layer deposition (ALD) in a nutshell
Riikka Puurunen
?
(Journal Club) Folding DNA to create nanoscale shapes and patterns
(Journal Club) Folding DNA to create nanoscale shapes and patterns(Journal Club) Folding DNA to create nanoscale shapes and patterns
(Journal Club) Folding DNA to create nanoscale shapes and patterns
David Podorefsky, PhD
?
Science project class 10 Covid-19 Tuberculosis
Science project class 10 Covid-19 TuberculosisScience project class 10 Covid-19 Tuberculosis
Science project class 10 Covid-19 Tuberculosis
VEENAKSHI PATHAK
?
Anti inflammatory agents from marine bacteria-17.pptx
Anti inflammatory agents from marine bacteria-17.pptxAnti inflammatory agents from marine bacteria-17.pptx
Anti inflammatory agents from marine bacteria-17.pptx
AhmedAbouelwafa7
?
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
David Podorefsky, PhD
?
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGYQUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
tbalagbis5
?
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE BiologyThe Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
Blessing Ndazie
?
SCIENCE 7 Q4 1 Classifying Geological Faults.pptx
SCIENCE 7 Q4 1 Classifying Geological Faults.pptxSCIENCE 7 Q4 1 Classifying Geological Faults.pptx
SCIENCE 7 Q4 1 Classifying Geological Faults.pptx
ROLANARIBATO3
?
Drugs and Their Effects | Cambridge IGCSE Biology
Drugs and Their Effects | Cambridge IGCSE BiologyDrugs and Their Effects | Cambridge IGCSE Biology
Drugs and Their Effects | Cambridge IGCSE Biology
Blessing Ndazie
?
Direct Gene Transfer Techniques for Developing Transgenic Plants
Direct Gene Transfer Techniques for Developing Transgenic PlantsDirect Gene Transfer Techniques for Developing Transgenic Plants
Direct Gene Transfer Techniques for Developing Transgenic Plants
Kuldeep Gauliya
?
2025-03-03-Data-related-Ethics Issues in Technologies for Professional Learni...
2025-03-03-Data-related-Ethics Issues in Technologies for Professional Learni...2025-03-03-Data-related-Ethics Issues in Technologies for Professional Learni...
2025-03-03-Data-related-Ethics Issues in Technologies for Professional Learni...
Graz University of Technology & Know-Center
?
ABA_in_plant_abiotic_stress_mitigation1.ppt
ABA_in_plant_abiotic_stress_mitigation1.pptABA_in_plant_abiotic_stress_mitigation1.ppt
ABA_in_plant_abiotic_stress_mitigation1.ppt
laxmichoudhary77657
?
Seymour Benzer's experiment and complementation test
Seymour Benzer's experiment and complementation testSeymour Benzer's experiment and complementation test
Seymour Benzer's experiment and complementation test
AkankshaSindhiya
?
GALILEO'S OBSERVATION ni Karlo Mariano.pptx
GALILEO'S OBSERVATION ni Karlo Mariano.pptxGALILEO'S OBSERVATION ni Karlo Mariano.pptx
GALILEO'S OBSERVATION ni Karlo Mariano.pptx
ejrguillermo
?
Leafcurl viral disease presentation.pptx
Leafcurl viral disease presentation.pptxLeafcurl viral disease presentation.pptx
Leafcurl viral disease presentation.pptx
Mir Ali M
?
Propagation of electromagnetic waves in free space.pptx
Propagation of electromagnetic waves in free space.pptxPropagation of electromagnetic waves in free space.pptx
Propagation of electromagnetic waves in free space.pptx
kanmanivarsha
?
(Journal Club) - Integration of multiple lineage measurements from the same c...
(Journal Club) - Integration of multiple lineage measurements from the same c...(Journal Club) - Integration of multiple lineage measurements from the same c...
(Journal Club) - Integration of multiple lineage measurements from the same c...
David Podorefsky, PhD
?
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
(Journal Club) - Understanding tumor ecosystems by single-cell sequencing: pr...
David Podorefsky, PhD
?
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Detection of ferrihydrite in Martian red dust records ancient cold and wet co...
Srgio Sacani
?
Cell Structure & Function | Cambridge IGCSE Biology
Cell Structure & Function | Cambridge IGCSE BiologyCell Structure & Function | Cambridge IGCSE Biology
Cell Structure & Function | Cambridge IGCSE Biology
Blessing Ndazie
?
History of atomic layer deposition (ALD) in a nutshell
History of atomic layer deposition (ALD) in a nutshellHistory of atomic layer deposition (ALD) in a nutshell
History of atomic layer deposition (ALD) in a nutshell
Riikka Puurunen
?
(Journal Club) Folding DNA to create nanoscale shapes and patterns
(Journal Club) Folding DNA to create nanoscale shapes and patterns(Journal Club) Folding DNA to create nanoscale shapes and patterns
(Journal Club) Folding DNA to create nanoscale shapes and patterns
David Podorefsky, PhD
?
Science project class 10 Covid-19 Tuberculosis
Science project class 10 Covid-19 TuberculosisScience project class 10 Covid-19 Tuberculosis
Science project class 10 Covid-19 Tuberculosis
VEENAKSHI PATHAK
?
Anti inflammatory agents from marine bacteria-17.pptx
Anti inflammatory agents from marine bacteria-17.pptxAnti inflammatory agents from marine bacteria-17.pptx
Anti inflammatory agents from marine bacteria-17.pptx
AhmedAbouelwafa7
?
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
(Journal Club) Focused ultrasound excites neurons via mechanosensitive calciu...
David Podorefsky, PhD
?
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGYQUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
QUIZ 1 in SCIENCE GRADE 10 QUARTER 3 BIOLOGY
tbalagbis5
?
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE BiologyThe Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
The Sense Organs: Structure and Function of the Eye and Skin | IGCSE Biology
Blessing Ndazie
?
SCIENCE 7 Q4 1 Classifying Geological Faults.pptx
SCIENCE 7 Q4 1 Classifying Geological Faults.pptxSCIENCE 7 Q4 1 Classifying Geological Faults.pptx
SCIENCE 7 Q4 1 Classifying Geological Faults.pptx
ROLANARIBATO3
?
Drugs and Their Effects | Cambridge IGCSE Biology
Drugs and Their Effects | Cambridge IGCSE BiologyDrugs and Their Effects | Cambridge IGCSE Biology
Drugs and Their Effects | Cambridge IGCSE Biology
Blessing Ndazie
?
Direct Gene Transfer Techniques for Developing Transgenic Plants
Direct Gene Transfer Techniques for Developing Transgenic PlantsDirect Gene Transfer Techniques for Developing Transgenic Plants
Direct Gene Transfer Techniques for Developing Transgenic Plants
Kuldeep Gauliya
?

Fashion 10000: An Enriched Dataset of Fashion and Clothing

  • 1. Fashion 10000 An Enriched Dataset of Fashion and Clothing Presentation: Michael Riegler, Klagenfurt University & TU Delft Babak Loni, TU Delft Lei Yen Cheung, TU Delft Alessandro Bozzon, TU Delft Luke Gottlieb, ICSI Martha Larson, TU Delft
  • 2. Table of Content ? Introduction ? Dataset Collection ? Dataset Annotation C Statistics ? Applications of Dataset ? Conclusion
  • 3. The Dataset ? Social Images ? At least 10000 fashion- related images ? Social metadata ? Creative Common images ? Annotated with different labels
  • 4. The Collection Wikipedia 470 Fashion Categories Flickr - Query only CC attribution images - Query should also appear in tags - Top relevant images 32K Images 262 Categories Flickr Fashion 10000 + MTurk Annotations + Metadata
  • 5. Metadata ? Collected in xml and csv format C Title, description, owner, Tags, Location, geo-parameters ? Additional metadata: Info, Geos, Context, Tags, Notes, Favorites, Urls, Comments
  • 6. General Statistics Pairs fashion item, photo 32,398 Number of distinct fashion categories 262 Max/avg/min nr of photos per fashion item 200/ 122.95 / 10 Number of photos with geo annotations 7,933 Total number of comments 58,578 Max/avg/min nr of comments per photo 575 / 7.35/ 1 Total number of tags, photo pairs 460,907 Total number of distinct tags 56,275 Max/avg/min nr of tags per photo 136/ 15.15/ 1 Total number of notes, photo pairs 5,892 Max/avg/min nr of notes per photo 195/ 5.31/ 1 Total number of favorites 37,131 Max/avg/min nr of favorites per photo 20/ 3.61/ 1 Total number of contexts 110,505 Max/avg/min nr of contexts per photo 206/ 3.93/ 1
  • 7. General Statistics Pairs fashion item, photo 32,398 Number of distinct fashion categories 262 Max/avg/min nr of photos per fashion item 200/ 122.95 / 10 Number of photos with geo annotations 7,933 Total number of comments 58,578 Max/avg/min nr of comments per photo 575 / 7.35/ 1 Total number of tags, photo pairs 460,907 Total number of distinct tags 56,275 Max/avg/min nr of tags per photo 136/ 15.15/ 1 Total number of notes, photo pairs 5,892 Max/avg/min nr of notes per photo 195/ 5.31/ 1 Total number of favorites 37,131 Max/avg/min nr of favorites per photo 20/ 3.61/ 1 Total number of contexts 110,505 Max/avg/min nr of contexts per photo 206/ 3.93/ 1
  • 8. General Statistics Pairs fashion item, photo 32,398 Number of distinct fashion categories 262 Max/avg/min nr of photos per fashion item 200/ 122.95 / 10 Number of photos with geo annotations 7,933 Total number of comments 58,578 Max/avg/min nr of comments per photo 575 / 7.35/ 1 Total number of tags, photo pairs 460,907 Total number of distinct tags 56,275 Max/avg/min nr of tags per photo 136/ 15.15/ 1 Total number of notes, photo pairs 5,892 Max/avg/min nr of notes per photo 195/ 5.31/ 1 Total number of favorites 37,131 Max/avg/min nr of favorites per photo 20/ 3.61/ 1 Total number of contexts 110,505 Max/avg/min nr of contexts per photo 206/ 3.93/ 1
  • 9. General Statistics Pairs fashion item, photo 32,398 Number of distinct fashion categories 262 Max/avg/min nr of photos per fashion item 200/ 122.95 / 10 Number of photos with geo annotations7,933 Total number of comments 58,578 Max/avg/min nr of comments per photo 575 / 7.35/ 1 Total number of tags, photo pairs 460,907 Total number of distinct tags 56,275 Max/avg/min nr of tags per photo 136/ 15.15/ 1 Total number of notes, photo pairs 5,892 Max/avg/min nr of notes per photo 195/ 5.31/ 1 Total number of favorites 37,131 Max/avg/min nr of favorites per photo 20/ 3.61/ 1 Total number of contexts 110,505 Max/avg/min nr of contexts per photo 206/ 3.93/ 1
  • 10. Dataset Annotation ? Some images might not be relevant to fashion and clothing ? The ground truth differentiates relevant from non-relevant
  • 11. Dataset Annotation ? We used AMT to create ground truth for the images ? The fashion category is described with a definition from Wikipedia ? 6 questions to create 6 labels for each of the images ? We also ask about familiarity of workers with the fashion category
  • 14. HIT Questions (Labels) Question Possible values Q1) Fashion / Clothing Related yes C no - notsure Q2) Specialty clothing item (image Category) yes C no - notsure Q3) Number of people nopeople C onepeople - manypeople Q4) Professional model or not? yes C no C notapp (not applicable) Q5) Person wearing fashion? yes C no C noperson C notapp (not applicable) Q6) Formal / Informal formalmen - formalwomen - informalmen informalwomen C other (cross-dressing or multiple persons) C notapp (not applicatble)
  • 15. Annotation Statistics Total number of assignments 24,457 % of rejected assignments 4 % Total number of unique workers 1470 Avg. number of assignment by each worker 17 Avg. Completion time 127 sec Avg. familiarity of workers with fashion items 5.8 (range 1-7) Question 1 2 3 4 5 6 Kappa Value 0.66 0.65 0.85 0.51 0.38 0.48
  • 16. Dataset Statistics ? Using the generated ground truth the statistics about the images were calculated Number of fashion related images 18,487 Number of images with many people 7,417 Number of images with one person 9,771 Number of images with no person 13,179 Number of images with intention of showing fashion 9,096 Number of professional fashion images 2,814
  • 17. Applications of the Dataset ? Developing social media content analysis C Game with a purpose (domino game) ? Basis for the brave new task in MediaEval multimedia benchmarking initiative ? Use case for the proof of intentional framing
  • 18. Conclusion ? Fashion dataset ? Six different labels ? AMT generated ground truth ? Can be used in various research areas ? Evaluated in the MediaEval Benchmark

Editor's Notes

  • #4: Social Images: From user-generated fashion categories to user-generated images Should include at least 10000 fashion-related images Should include social metadata Only Creative Common Attribution images Images should be annotated with different labels
  • #6: Collected in xml and csv format General information about images are available in xml format Title, description, owner,.. Tags Location, geo-parameters Additional metadata were collected using different Flickr APIs and converted to csv format Info, Geos, Comments, Context, Tags, Notes, Favorites, Urls
  • #11: Some images might not be relevant to fashion and clothing The ground truth for the images differentiates relevant from non-relevant images Depending on the usage of dataset different type of annotations might be necessary
  • #12: We used AMT to create ground truth for the images In each HIT we ask workers to annotate 4 images from one category The fashion category is described to workers with a definition from Wikipedia In total we ask 6 questions to create 6 labels for each of the images We also ask about familiarity of workers with the Fashion Category The range of value can be from 1 (unfamiliar) to 7 (familiar) Different options are explained with visual popups In addition to the csv file generated by AMT, we created another csv to represent ground truth Each of the images are annotated by 3 workers The ground truth are generated by majority voting
  • #16: Kappa statistics have been used to calculate the agreement among annotators For each of the six questions in the HIT, the agreement among three workers was calculated separately General statistics about the workers and the hit