際際滷

際際滷Share a Scribd company logo
Solving Logical Puzzles with
Natural Language
Processing
Pycon-India 2015
by
Ashutosh Trivedi
Founder, bracketPy
www.bracketpy.com
Pycon-India-2015
The NLP Story
I know Natural Language I know Processing
Pycon-India-2015Pycon-India-2015
To understand natural language
programatically
But how ?
Part of Speech
Vocabulary
n-gram
Word style
Noun detection
The NLP Story..
Pycon-India-2015
The NLP Story..
Hey, I am a sentence.. can you
process me ?
current word
previous word
next word
current word n-gram
POS tag
surrounding POS tag sequence
word shape (all Caps?)
surrounding word shape
presence of word in left/right window
Information
Pycon-India-2015
What to Do With the Information?
Lets learn a model
F1 ) current word
F2 ) previous word
F3) next word
F4) current word n-gram
F5) POS tag
F6) surrounding POS tag sequence
F7) word shape (all Caps?)
F8) surrounding word shape
F9) presence of word in left/right window
Features -
Pycon-India-2015
What to Do With the Information?
W1F1 + W2F2 + .+ WnFnsentence 1 positive
W1F1 + W2F2 + .+ WnFnsentence 2 negative
W1F1 + W2F2 + .+ WnFnsentence 3 positive
W1F1 + W2F2 + .+ WnFnsentence x negative
.
.
.
Pycon-India-2015
Is that it ?
Data features
Through other processes
Machine Learning
learn weights
(optimise weights)
Is it scalable ?
Pycon-India-2015
So what is wrong ?
We are talking about AI
Pycon-India-2015
How do we represent a word?
Index in vocab..
30th word [0,0,.1,0,0,0,0]
30
45th word [0,0,.0,0,0,0,1,0]
45
How would a processor know that good at index 30 is
synonym for nice at index 45
Pycon-India-2015
Wordnets..
Problems
Manual labour
Scalability
New words
Nouns
Pycon-India-2015
Wordnets..
Ipython Demo
Pycon-India-2015
So how do we remember words.. ?
Our human hooks 
associated word
person
context
taste
smell
time
visual
feelings
Pycon-India-2015
how to represent words programmatically ?
You, as an individual is average of 5 people you spend time
with everyday.
You shall know a word by the company it keeps
-J.R.Firth 1957
Pycon-India-2015
How to make neighbours represent the word ?
One of the most successful ideas of statistical NLP
Co-occurrence matrix
Capture both syntactical and semantical information
Pycon-India-2015
Window based co-occurrence matrix
Example corpus:
1) I like deep learning.
2) I like NLP.
3) I enjoy 鍖ying.
Pycon-India-2015
Problem with simple co-occurrence matrix
Increase size with vocabulary
very high dimensional Requires a lot of storage
subsequent classi鍖cation model has sparsity issues
Pycon-India-2015
Reduce the dimension
store information in few dimension - 25 to 1000
dense vectors
less space
Pycon-India-2015
Singular Value Decomposition (SVD)
Pycon-India-2015
SVD
Ipython Demo
Pycon-India-2015
Problem with SVD
Computational cost scales quadratically for N x M matrix
Bad for million of words or document
Hard to incorporate new words
Pycon-India-2015
Word2Vec
Directly learn low dimensional vectors
Instead of capturing co-occurrence counts directly, Predict
surrounding words of every word
Glove: Global Vectors for Word
Representation by Pennington et al. (2014)
Faster and can easily incorporate a new sentence/ document
or add a word to the vocabulary
Pycon-India-2015
Word2Vec
The Skip-gram Model
objective of the Skip-gram model is to 鍖nd word
representations by the surrounding words in a sentence
or a document.
W1, W2, W3, . . . , Wt  sentence/document
Maximise the log probability of any context word given the
current centre word.
Pycon-India-2015
Word2Vec
The Skip-gram Model
For a window of T=3
w1,w2,w3,Wc,w4,w5,w6
Maximise the probability of (w1,w2,w3, w4,w5,w6) for given
word Wc
Pycon-India-2015
Word2Vec
Unsupervised method
We are just optimising the probability of words with respect
to its neighbours
creating a low dimensional space (probabilistic)
Pycon-India-2015
Lower Dimensions
Dimension of Similarity
Dimension of sentiment ?
Dimension of POS ?
Dimension of all word having 5 vowels
It can be anything .
word embeddings
Pycon-India-2015
Dimension of similarity
Analogies testing dimensions of similarity can be solved quite
well just by doing vector subtraction in the embedding space
Syntactically.
X  X  X  X  X  X
Syntactical - Singular, Plural
Pycon-India-2015
Dimension of similarity
Semantical
Similarly for verb and adjective morphological forms
Semantically (Semeval 2012 task 2)
Xh  X$  Xh  X$
X  X  X  Xゐn
Pycon-India-2015
Dimension of similarity
Test for linear relationships, examined by Mikolov et al.
a:b :: c: ?
man: woman :: king : ?
man
king
woman
queen
1
0.75
0.5
0.25
0
0.25 0.5 0.75 1
+ king [0.5, 0.2]
- man [0.25, 0.5]
+ woman [0.6, 1 ]
-
queen [0.85, 0.7]
Pycon-India-2015
Word2Vec
Ipython Demo
Pycon-India-2015
Pycon-India-2015
Pycon India , Ashutosh Trivedi
Superlatives
Pycon-India-2015
Company - CEOs
Pycon-India-2015
Reference
Deep Learning for Natural Language Processing CS224d
Stanfor.edu (http://cs224d.stanford.edu/) 
Mikolov, G. C. T., K. Chen, and J. Dean. "word2vec (2013). 
Pennington, Je鍖rey, Richard Socher, and Christopher D. Manning.
"Glove: Global vectors for word representation." Proceedings of the Empiricial
Methods in
Natural Language Processing (EMNLP 2014) 12 (2014): 1532-1543. 
gensim : Topic Modeling for Humans (https://radimrehurek.com/gensim/) 
Pycon-India-2015
Ad

Recommended

Ai lecture 07(unit03)
Ai lecture 07(unit03)
vikas dhakane
Interpreter Design Pattern real-time examples
Interpreter Design Pattern real-time examples
RAKESH P
AI Lesson 09
AI Lesson 09
Assistant Professor
Spark algorithms
Spark algorithms
Ashutosh Trivedi
GraphX and Pregel - Apache Spark
GraphX and Pregel - Apache Spark
Ashutosh Trivedi
Natural Language Processing
Natural Language Processing
punedevscom
Bijaya Zenchenko - An Embedding is Worth 1000 Words - Start Using Word Embedd...
Bijaya Zenchenko - An Embedding is Worth 1000 Words - Start Using Word Embedd...
Rehgan Avon
word vector embeddings in natural languag processing
word vector embeddings in natural languag processing
ReetShinde
Lda2vec text by the bay 2016 with notes
Lda2vec text by the bay 2016 with notes
Christopher Moody
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Saurabh Kaushik
A Panorama of Natural Language Processing
A Panorama of Natural Language Processing
Ted Xiao
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Bhaskar Mitra
Tom叩邸 Mikolov - Distributed Representations for NLP
Tom叩邸 Mikolov - Distributed Representations for NLP
Machine Learning Prague
Word embedding
Word embedding
ShivaniChoudhary74
Word embeddings
Word embeddings
Shruti kar
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
Katerina Vylomova
wordembedding.pptx
wordembedding.pptx
JOBANPREETSINGH62
Pycon ke word vectors
Pycon ke word vectors
Osebe Sammi
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networks
connectbeubax
Word2vector
Word2vector
Ashis Chanda
Word 2 vector
Word 2 vector
Ashis Kumar Chanda
Tutorial on word2vec
Tutorial on word2vec
Leiden University
Word Embeddings - Introduction
Word Embeddings - Introduction
Christian Perone
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Katerina Vylomova
Embedding for fun fumarola Meetup Milano DLI luglio
Embedding for fun fumarola Meetup Milano DLI luglio
Deep Learning Italia
Word2Vec
Word2Vec
hyunyoung Lee
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
Kv Sagar
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Puppy jhon
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Safe Software

More Related Content

Similar to Ashutosh pycon (20)

Lda2vec text by the bay 2016 with notes
Lda2vec text by the bay 2016 with notes
Christopher Moody
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Saurabh Kaushik
A Panorama of Natural Language Processing
A Panorama of Natural Language Processing
Ted Xiao
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Bhaskar Mitra
Tom叩邸 Mikolov - Distributed Representations for NLP
Tom叩邸 Mikolov - Distributed Representations for NLP
Machine Learning Prague
Word embedding
Word embedding
ShivaniChoudhary74
Word embeddings
Word embeddings
Shruti kar
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
Katerina Vylomova
wordembedding.pptx
wordembedding.pptx
JOBANPREETSINGH62
Pycon ke word vectors
Pycon ke word vectors
Osebe Sammi
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networks
connectbeubax
Word2vector
Word2vector
Ashis Chanda
Word 2 vector
Word 2 vector
Ashis Kumar Chanda
Tutorial on word2vec
Tutorial on word2vec
Leiden University
Word Embeddings - Introduction
Word Embeddings - Introduction
Christian Perone
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Katerina Vylomova
Embedding for fun fumarola Meetup Milano DLI luglio
Embedding for fun fumarola Meetup Milano DLI luglio
Deep Learning Italia
Word2Vec
Word2Vec
hyunyoung Lee
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
Kv Sagar
Lda2vec text by the bay 2016 with notes
Lda2vec text by the bay 2016 with notes
Christopher Moody
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn
RwanEnan
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Engineering Intelligent NLP Applications Using Deep Learning Part 1
Saurabh Kaushik
A Panorama of Natural Language Processing
A Panorama of Natural Language Processing
Ted Xiao
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Bhaskar Mitra
Tom叩邸 Mikolov - Distributed Representations for NLP
Tom叩邸 Mikolov - Distributed Representations for NLP
Machine Learning Prague
Word embeddings
Word embeddings
Shruti kar
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
Katerina Vylomova
Pycon ke word vectors
Pycon ke word vectors
Osebe Sammi
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networks
connectbeubax
Word Embeddings - Introduction
Word Embeddings - Introduction
Christian Perone
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Katerina Vylomova
Embedding for fun fumarola Meetup Milano DLI luglio
Embedding for fun fumarola Meetup Milano DLI luglio
Deep Learning Italia
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
DL-CO2 -Session 3 Learning Vectorial Representations of Words.pptx
Kv Sagar

Recently uploaded (20)

Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Puppy jhon
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Safe Software
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely
War_And_Cyber_3_Years_Of_Struggle_And_Lessons_For_Global_Security.pdf
War_And_Cyber_3_Years_Of_Struggle_And_Lessons_For_Global_Security.pdf
biswajitbanerjee38
Kubernetes Security Act Now Before Its Too Late
Kubernetes Security Act Now Before Its Too Late
Michael Furman
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Safe Software
Crypto Super 500 - 14th Report - June2025.pdf
Crypto Super 500 - 14th Report - June2025.pdf
Stephen Perrenod
The State of Web3 Industry- Industry Report
The State of Web3 Industry- Industry Report
Liveplex
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
FME for Good: Integrating Multiple Data Sources with APIs to Support Local Ch...
FME for Good: Integrating Multiple Data Sources with APIs to Support Local Ch...
Safe Software
From Enterprise to Makers: Driving Vision AI Innovation at the Extreme Edge,...
From Enterprise to Makers: Driving Vision AI Innovation at the Extreme Edge,...
Edge AI and Vision Alliance
FIDO Seminar: Perspectives on Passkeys & Consumer Adoption.pptx
FIDO Seminar: Perspectives on Passkeys & Consumer Adoption.pptx
FIDO Alliance
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
Safe Software
Down the Rabbit Hole Solving 5 Training Roadblocks
Down the Rabbit Hole Solving 5 Training Roadblocks
Rustici Software
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Alliance
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
NTT DATA Technology & Innovation
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Alliance
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Puppy jhon
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Reducing Conflicts and Increasing Safety Along the Cycling Networks of East-F...
Safe Software
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely
War_And_Cyber_3_Years_Of_Struggle_And_Lessons_For_Global_Security.pdf
War_And_Cyber_3_Years_Of_Struggle_And_Lessons_For_Global_Security.pdf
biswajitbanerjee38
Kubernetes Security Act Now Before Its Too Late
Kubernetes Security Act Now Before Its Too Late
Michael Furman
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Safe Software
Crypto Super 500 - 14th Report - June2025.pdf
Crypto Super 500 - 14th Report - June2025.pdf
Stephen Perrenod
The State of Web3 Industry- Industry Report
The State of Web3 Industry- Industry Report
Liveplex
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
FME for Good: Integrating Multiple Data Sources with APIs to Support Local Ch...
FME for Good: Integrating Multiple Data Sources with APIs to Support Local Ch...
Safe Software
From Enterprise to Makers: Driving Vision AI Innovation at the Extreme Edge,...
From Enterprise to Makers: Driving Vision AI Innovation at the Extreme Edge,...
Edge AI and Vision Alliance
FIDO Seminar: Perspectives on Passkeys & Consumer Adoption.pptx
FIDO Seminar: Perspectives on Passkeys & Consumer Adoption.pptx
FIDO Alliance
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
Safe Software
Down the Rabbit Hole Solving 5 Training Roadblocks
Down the Rabbit Hole Solving 5 Training Roadblocks
Rustici Software
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Alliance
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
NTT DATA Technology & Innovation
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Alliance
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
Ad

Ashutosh pycon

  • 1. Solving Logical Puzzles with Natural Language Processing Pycon-India 2015 by Ashutosh Trivedi Founder, bracketPy www.bracketpy.com Pycon-India-2015
  • 2. The NLP Story I know Natural Language I know Processing Pycon-India-2015Pycon-India-2015
  • 3. To understand natural language programatically But how ? Part of Speech Vocabulary n-gram Word style Noun detection The NLP Story.. Pycon-India-2015
  • 4. The NLP Story.. Hey, I am a sentence.. can you process me ? current word previous word next word current word n-gram POS tag surrounding POS tag sequence word shape (all Caps?) surrounding word shape presence of word in left/right window Information Pycon-India-2015
  • 5. What to Do With the Information? Lets learn a model F1 ) current word F2 ) previous word F3) next word F4) current word n-gram F5) POS tag F6) surrounding POS tag sequence F7) word shape (all Caps?) F8) surrounding word shape F9) presence of word in left/right window Features - Pycon-India-2015
  • 6. What to Do With the Information? W1F1 + W2F2 + .+ WnFnsentence 1 positive W1F1 + W2F2 + .+ WnFnsentence 2 negative W1F1 + W2F2 + .+ WnFnsentence 3 positive W1F1 + W2F2 + .+ WnFnsentence x negative . . . Pycon-India-2015
  • 7. Is that it ? Data features Through other processes Machine Learning learn weights (optimise weights) Is it scalable ? Pycon-India-2015
  • 8. So what is wrong ? We are talking about AI Pycon-India-2015
  • 9. How do we represent a word? Index in vocab.. 30th word [0,0,.1,0,0,0,0] 30 45th word [0,0,.0,0,0,0,1,0] 45 How would a processor know that good at index 30 is synonym for nice at index 45 Pycon-India-2015
  • 12. So how do we remember words.. ? Our human hooks associated word person context taste smell time visual feelings Pycon-India-2015
  • 13. how to represent words programmatically ? You, as an individual is average of 5 people you spend time with everyday. You shall know a word by the company it keeps -J.R.Firth 1957 Pycon-India-2015
  • 14. How to make neighbours represent the word ? One of the most successful ideas of statistical NLP Co-occurrence matrix Capture both syntactical and semantical information Pycon-India-2015
  • 15. Window based co-occurrence matrix Example corpus: 1) I like deep learning. 2) I like NLP. 3) I enjoy 鍖ying. Pycon-India-2015
  • 16. Problem with simple co-occurrence matrix Increase size with vocabulary very high dimensional Requires a lot of storage subsequent classi鍖cation model has sparsity issues Pycon-India-2015
  • 17. Reduce the dimension store information in few dimension - 25 to 1000 dense vectors less space Pycon-India-2015
  • 18. Singular Value Decomposition (SVD) Pycon-India-2015
  • 20. Problem with SVD Computational cost scales quadratically for N x M matrix Bad for million of words or document Hard to incorporate new words Pycon-India-2015
  • 21. Word2Vec Directly learn low dimensional vectors Instead of capturing co-occurrence counts directly, Predict surrounding words of every word Glove: Global Vectors for Word Representation by Pennington et al. (2014) Faster and can easily incorporate a new sentence/ document or add a word to the vocabulary Pycon-India-2015
  • 22. Word2Vec The Skip-gram Model objective of the Skip-gram model is to 鍖nd word representations by the surrounding words in a sentence or a document. W1, W2, W3, . . . , Wt sentence/document Maximise the log probability of any context word given the current centre word. Pycon-India-2015
  • 23. Word2Vec The Skip-gram Model For a window of T=3 w1,w2,w3,Wc,w4,w5,w6 Maximise the probability of (w1,w2,w3, w4,w5,w6) for given word Wc Pycon-India-2015
  • 24. Word2Vec Unsupervised method We are just optimising the probability of words with respect to its neighbours creating a low dimensional space (probabilistic) Pycon-India-2015
  • 25. Lower Dimensions Dimension of Similarity Dimension of sentiment ? Dimension of POS ? Dimension of all word having 5 vowels It can be anything . word embeddings Pycon-India-2015
  • 26. Dimension of similarity Analogies testing dimensions of similarity can be solved quite well just by doing vector subtraction in the embedding space Syntactically. X X X X X X Syntactical - Singular, Plural Pycon-India-2015
  • 27. Dimension of similarity Semantical Similarly for verb and adjective morphological forms Semantically (Semeval 2012 task 2) Xh X$ Xh X$ X X X Xゐn Pycon-India-2015
  • 28. Dimension of similarity Test for linear relationships, examined by Mikolov et al. a:b :: c: ? man: woman :: king : ? man king woman queen 1 0.75 0.5 0.25 0 0.25 0.5 0.75 1 + king [0.5, 0.2] - man [0.25, 0.5] + woman [0.6, 1 ] - queen [0.85, 0.7] Pycon-India-2015
  • 31. Pycon India , Ashutosh Trivedi Superlatives Pycon-India-2015
  • 33. Reference Deep Learning for Natural Language Processing CS224d Stanfor.edu (http://cs224d.stanford.edu/) Mikolov, G. C. T., K. Chen, and J. Dean. "word2vec (2013). Pennington, Je鍖rey, Richard Socher, and Christopher D. Manning. "Glove: Global vectors for word representation." Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014) 12 (2014): 1532-1543. gensim : Topic Modeling for Humans (https://radimrehurek.com/gensim/) Pycon-India-2015