ݺߣ

ݺߣShare a Scribd company logo
Vector?Spaces?for?
Information?Extraction
Behrang?Q.?Zadeh
behrangatoffice@gmail.com
Knowledge?Discovery?Unit?@?Insight?Centre?@

National?University?of?Ireland,?Galway
Insight?Workshop?on?Latent?Space?Methods?C Dublin,?UCD,?2014
Vector?Spaces?in?Information?Extraction

Entities?to?be?Extracted?or?compared

Contexts?that?are?used?for?comparison

? Vector?spaces?in?IE?are:
? a representation?framework?for?the
Distributional?Hypothesis*;?
? Sparse;
? Large?(order?of?millions?by?millions);
? Changing?Dynamically;

*not?exclusively?
Vector?Spaces?in?Information?Extraction

Entities?to?be?Extracted?or?compared

Contexts?that?are?used?for?comparison

? In?classic?methods?the dimension?
of?VSM growths?as data?growth.?

? Dimension?Reduction?techniques?
based?on?Matrix?Factorization?
may?not?be?applied:
?

Iterative?methods?are?still?of the?
complexity?of?O(n2)?
Vector?Spaces?in?Information?Extraction
? Random?Projection?is?one?solution:

? Estimate?a?VSM by?a?random?projection?matrix?that made?
of?a?set?of?randomly?created?vectors.
? i.e.?based on?the?Johnson\Lindenstrauss lemma
? verified by?the?results?reported?in?(Hecht\Nielsen,?1994)

*?The?above?figure?is?copyrighted?by?Alex?Clemmer (http://nullspace.io/)?
Vector?Spaces?in?Information?Extraction

? Random?Projection?\ Application?
Example
? Extraction?of?Technology?Terms?(term?classification)
? Data?Size:?only 10,000 publication

? Contexts: words?and?their?position in?the?
neighbourhood?of?terms

? Original?Dimension:?
? approximately?

5 million

? Reducing?the?dimension?to?2000 using?

Random?Projection

Behrangs research?evolves?around?classification?and?finding?the?optimal?contexts?in?random?vector?spaces?for??the?extraction?of?technology?terms?and?their?relation.?If?you?are?interested?please?email?him?at?

behrangatoffice@gmail.com
Ad

Recommended

Poster: ICPR 2008
Poster: ICPR 2008
Mahfuzul Haque
?
Targeting accurate object extraction from an image a comprehensive study of ...
Targeting accurate object extraction from an image a comprehensive study of ...
LogicMindtech Nologies
?
ާڧڧ ֧. ѧ֧ާѧڧܧ ҧݧڧ էѧߧߧ: ֧ߧ٧, ߧ֧֧ۧ, ҧѧ֧ۧӧܧڧ ӧ...
ާڧڧ ֧. ѧ֧ާѧڧܧ ҧݧڧ էѧߧߧ: ֧ߧ٧, ߧ֧֧ۧ, ҧѧ֧ۧӧܧڧ ӧ...
Yandex
?
Coordinated and adaptive information collecting in target tracking wireless s...
Coordinated and adaptive information collecting in target tracking wireless s...
LogicMindtech Nologies
?
Connections b/w active learning and model extraction
Connections b/w active learning and model extraction
Anmol Dwivedi
?
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
IRJET Journal
?
Information Access to Medical Image Data: from Big Data to Semantics - Academ...
Information Access to Medical Image Data: from Big Data to Semantics - Academ...
Institute of Information Systems (HES-SO)
?
Abstract kataoka
Abstract kataoka
harmonylab
?
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
net2-project
?
Random Manhattan Indexing
Random Manhattan Indexing
net2-project
?
Extracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting Preparation
net2-project
?
Description logic
Description logic
balamurugan.k Kalibalamurugan
?
Reasoning in Description Logics
Reasoning in Description Logics
R A Akerkar
?
ꥯ`پ𲹰ʹ
ꥯ`پ𲹰ʹ
Recruit Technologies
?
Ontology
Ontology
Sudarsun Santhiappan
?
ꥯ`ʽȻIʏܽ
ꥯ`ʽȻIʏܽ
Recruit Technologies
?
T ?ǘ컯ǩ`컯뼼g?
T ?ǘ컯ǩ`컯뼼g?
Yuya Unno
?
Deep Learning for Search
Deep Learning for Search
Bhaskar Mitra
?
A Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information Retrieval
Bhaskar Mitra
?
Vectorization - Georgia Tech - CSE6242 - March 2015
Vectorization - Georgia Tech - CSE6242 - March 2015
Josh Patterson
?
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
Marina Santini
?
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Jonathon Hare
?
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
RAtna29
?
Basic review on topic modeling
Basic review on topic modeling
Hiroyuki Kuromiya
?
Vector space model12345678910111213.pptx
Vector space model12345678910111213.pptx
someyamohsen2
?
Intro to Vectorization Concepts - GaTech cse6242
Intro to Vectorization Concepts - GaTech cse6242
Josh Patterson
?
Vsm lsi
Vsm lsi
Ryan Wang
?
Data Mining Essentials in Knowledge discovery in databases (KDD).
Data Mining Essentials in Knowledge discovery in databases (KDD).
senthilkumarm93
?
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
?
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
net2-project
?

More Related Content

Viewers also liked (9)

Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
net2-project
?
Random Manhattan Indexing
Random Manhattan Indexing
net2-project
?
Extracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting Preparation
net2-project
?
Description logic
Description logic
balamurugan.k Kalibalamurugan
?
Reasoning in Description Logics
Reasoning in Description Logics
R A Akerkar
?
ꥯ`پ𲹰ʹ
ꥯ`پ𲹰ʹ
Recruit Technologies
?
Ontology
Ontology
Sudarsun Santhiappan
?
ꥯ`ʽȻIʏܽ
ꥯ`ʽȻIʏܽ
Recruit Technologies
?
T ?ǘ컯ǩ`컯뼼g?
T ?ǘ컯ǩ`컯뼼g?
Yuya Unno
?
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
Tailoring Temporal Description Logics for Reasoning over Temporal Conceptual ...
net2-project
?
Random Manhattan Indexing
Random Manhattan Indexing
net2-project
?
Extracting Information for Context-aware Meeting Preparation
Extracting Information for Context-aware Meeting Preparation
net2-project
?
Reasoning in Description Logics
Reasoning in Description Logics
R A Akerkar
?
T ?ǘ컯ǩ`컯뼼g?
T ?ǘ컯ǩ`컯뼼g?
Yuya Unno
?

Similar to Vector spaces for information extraction - Random Projection Example (12)

Deep Learning for Search
Deep Learning for Search
Bhaskar Mitra
?
A Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information Retrieval
Bhaskar Mitra
?
Vectorization - Georgia Tech - CSE6242 - March 2015
Vectorization - Georgia Tech - CSE6242 - March 2015
Josh Patterson
?
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
Marina Santini
?
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Jonathon Hare
?
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
RAtna29
?
Basic review on topic modeling
Basic review on topic modeling
Hiroyuki Kuromiya
?
Vector space model12345678910111213.pptx
Vector space model12345678910111213.pptx
someyamohsen2
?
Intro to Vectorization Concepts - GaTech cse6242
Intro to Vectorization Concepts - GaTech cse6242
Josh Patterson
?
Vsm lsi
Vsm lsi
Ryan Wang
?
Data Mining Essentials in Knowledge discovery in databases (KDD).
Data Mining Essentials in Knowledge discovery in databases (KDD).
senthilkumarm93
?
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
?
Deep Learning for Search
Deep Learning for Search
Bhaskar Mitra
?
A Simple Introduction to Neural Information Retrieval
A Simple Introduction to Neural Information Retrieval
Bhaskar Mitra
?
Vectorization - Georgia Tech - CSE6242 - March 2015
Vectorization - Georgia Tech - CSE6242 - March 2015
Josh Patterson
?
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
Marina Santini
?
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Jonathon Hare
?
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
lecture14-distributed-reprennnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnsentations.pptx
RAtna29
?
Vector space model12345678910111213.pptx
Vector space model12345678910111213.pptx
someyamohsen2
?
Intro to Vectorization Concepts - GaTech cse6242
Intro to Vectorization Concepts - GaTech cse6242
Josh Patterson
?
Data Mining Essentials in Knowledge discovery in databases (KDD).
Data Mining Essentials in Knowledge discovery in databases (KDD).
senthilkumarm93
?
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
?
Ad

More from net2-project (14)

Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
net2-project
?
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Bases
net2-project
?
Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1
net2-project
?
Mining Semi-structured Data: Understanding Web-tables C Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables C Building a Taxonomy f...
net2-project
?
Extending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTables
net2-project
?
Managing Social Communities
Managing Social Communities
net2-project
?
Data Exchange over RDF
Data Exchange over RDF
net2-project
?
Exchanging more than Complete Data
Exchanging more than Complete Data
net2-project
?
Exchanging More than Complete Data
Exchanging More than Complete Data
net2-project
?
Exchanging More than Complete Data
Exchanging More than Complete Data
net2-project
?
Answer-set programming
Answer-set programming
net2-project
?
Evolving web, evolving search
Evolving web, evolving search
net2-project
?
XSPARQL Tutorial
XSPARQL Tutorial
net2-project
?
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
net2-project
?
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
net2-project
?
Exchanging OWL 2 QL Knowledge Bases
Exchanging OWL 2 QL Knowledge Bases
net2-project
?
Federation and Navigation in SPARQL 1.1
Federation and Navigation in SPARQL 1.1
net2-project
?
Mining Semi-structured Data: Understanding Web-tables C Building a Taxonomy f...
Mining Semi-structured Data: Understanding Web-tables C Building a Taxonomy f...
net2-project
?
Extending DBpedia (LOD) using WikiTables
Extending DBpedia (LOD) using WikiTables
net2-project
?
Managing Social Communities
Managing Social Communities
net2-project
?
Data Exchange over RDF
Data Exchange over RDF
net2-project
?
Exchanging more than Complete Data
Exchanging more than Complete Data
net2-project
?
Exchanging More than Complete Data
Exchanging More than Complete Data
net2-project
?
Exchanging More than Complete Data
Exchanging More than Complete Data
net2-project
?
Answer-set programming
Answer-set programming
net2-project
?
Evolving web, evolving search
Evolving web, evolving search
net2-project
?
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
net2-project
?
Ad

Recently uploaded (20)

Intellectual Property Right (Jurisprudence).pptx
Intellectual Property Right (Jurisprudence).pptx
Vishal Chanalia
?
OBSESSIVE COMPULSIVE DISORDER.pptx IN 5TH SEMESTER B.SC NURSING, 2ND YEAR GNM...
OBSESSIVE COMPULSIVE DISORDER.pptx IN 5TH SEMESTER B.SC NURSING, 2ND YEAR GNM...
parmarjuli1412
?
Hurricane Helene Application Documents Checklists
Hurricane Helene Application Documents Checklists
Mebane Rash
?
Paper 108 | Thoreaus Influence on Gandhi: The Evolution of Civil Disobedience
Paper 108 | Thoreaus Influence on Gandhi: The Evolution of Civil Disobedience
Rajdeep Bavaliya
?
Birnagar High School Platinum Jubilee Quiz.pptx
Birnagar High School Platinum Jubilee Quiz.pptx
Sourav Kr Podder
?
LDMMIA Shop & Student News Summer Solstice 25
LDMMIA Shop & Student News Summer Solstice 25
LDM & Mia eStudios
?
INDUCTIVE EFFECT slide for first prof pharamacy students
INDUCTIVE EFFECT slide for first prof pharamacy students
SHABNAM FAIZ
?
2025 June Year 9 Presentation: Subject selection.pptx
2025 June Year 9 Presentation: Subject selection.pptx
mansk2
?
Photo chemistry Power Point Presentation
Photo chemistry Power Point Presentation
mprpgcwa2024
?
Pests of Maize: An comprehensive overview.pptx
Pests of Maize: An comprehensive overview.pptx
Arshad Shaikh
?
Aprendendo Arquitetura Framework Salesforce - Dia 02
Aprendendo Arquitetura Framework Salesforce - Dia 02
Mauricio Alexandre Silva
?
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
?
How to Customize Quotation Layouts in Odoo 18
How to Customize Quotation Layouts in Odoo 18
Celine George
?
English 3 Quarter 1_LEwithLAS_Week 1.pdf
English 3 Quarter 1_LEwithLAS_Week 1.pdf
DeAsisAlyanajaneH
?
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
parmarjuli1412
?
How to use search fetch method in Odoo 18
How to use search fetch method in Odoo 18
Celine George
?
YSPH VMOC Special Report - Measles Outbreak Southwest US 6-14-2025.pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 6-14-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
?
GREAT QUIZ EXCHANGE 2025 - GENERAL QUIZ.pptx
GREAT QUIZ EXCHANGE 2025 - GENERAL QUIZ.pptx
Ronisha Das
?
This is why students from these 44 institutions have not received National Se...
This is why students from these 44 institutions have not received National Se...
Kweku Zurek
?
ECONOMICS, DISASTER MANAGEMENT, ROAD SAFETY - STUDY MATERIAL [10TH]
ECONOMICS, DISASTER MANAGEMENT, ROAD SAFETY - STUDY MATERIAL [10TH]
SHERAZ AHMAD LONE
?
Intellectual Property Right (Jurisprudence).pptx
Intellectual Property Right (Jurisprudence).pptx
Vishal Chanalia
?
OBSESSIVE COMPULSIVE DISORDER.pptx IN 5TH SEMESTER B.SC NURSING, 2ND YEAR GNM...
OBSESSIVE COMPULSIVE DISORDER.pptx IN 5TH SEMESTER B.SC NURSING, 2ND YEAR GNM...
parmarjuli1412
?
Hurricane Helene Application Documents Checklists
Hurricane Helene Application Documents Checklists
Mebane Rash
?
Paper 108 | Thoreaus Influence on Gandhi: The Evolution of Civil Disobedience
Paper 108 | Thoreaus Influence on Gandhi: The Evolution of Civil Disobedience
Rajdeep Bavaliya
?
Birnagar High School Platinum Jubilee Quiz.pptx
Birnagar High School Platinum Jubilee Quiz.pptx
Sourav Kr Podder
?
LDMMIA Shop & Student News Summer Solstice 25
LDMMIA Shop & Student News Summer Solstice 25
LDM & Mia eStudios
?
INDUCTIVE EFFECT slide for first prof pharamacy students
INDUCTIVE EFFECT slide for first prof pharamacy students
SHABNAM FAIZ
?
2025 June Year 9 Presentation: Subject selection.pptx
2025 June Year 9 Presentation: Subject selection.pptx
mansk2
?
Photo chemistry Power Point Presentation
Photo chemistry Power Point Presentation
mprpgcwa2024
?
Pests of Maize: An comprehensive overview.pptx
Pests of Maize: An comprehensive overview.pptx
Arshad Shaikh
?
Aprendendo Arquitetura Framework Salesforce - Dia 02
Aprendendo Arquitetura Framework Salesforce - Dia 02
Mauricio Alexandre Silva
?
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
?
How to Customize Quotation Layouts in Odoo 18
How to Customize Quotation Layouts in Odoo 18
Celine George
?
English 3 Quarter 1_LEwithLAS_Week 1.pdf
English 3 Quarter 1_LEwithLAS_Week 1.pdf
DeAsisAlyanajaneH
?
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
parmarjuli1412
?
How to use search fetch method in Odoo 18
How to use search fetch method in Odoo 18
Celine George
?
GREAT QUIZ EXCHANGE 2025 - GENERAL QUIZ.pptx
GREAT QUIZ EXCHANGE 2025 - GENERAL QUIZ.pptx
Ronisha Das
?
This is why students from these 44 institutions have not received National Se...
This is why students from these 44 institutions have not received National Se...
Kweku Zurek
?
ECONOMICS, DISASTER MANAGEMENT, ROAD SAFETY - STUDY MATERIAL [10TH]
ECONOMICS, DISASTER MANAGEMENT, ROAD SAFETY - STUDY MATERIAL [10TH]
SHERAZ AHMAD LONE
?

Vector spaces for information extraction - Random Projection Example