This document provides examples of various natural language processing (NLP) tasks and techniques, including part-of-speech tagging, named entity recognition, parsing, machine translation, and sentiment analysis. It shows the output of performing these NLP tasks on short text snippets. It also discusses the relative difficulty of different NLP problems, and provides some examples of NLP applications and tools.
6. Coreference resolution
Question answering (QA)
Part-of-speech (POS) tagging
Word sense disambiguation (WSD)
Paraphrase
Named entity recognition (NER)
Parsing
Summarization
Information extraction (IE)
Machine translation (MT)
Dialog
Sentiment analysis
mostly solved
making good progress
still really hard
Spam detection (Classification)
Let¡¯s go to Agra!
Buy V1AGRA ¡
?
?
Colorless green ideas sleep furiously.
ADJ ADJ NOUN VERB ADV
Einstein met with UN officials in Princeton
PERSON ORG LOC
You¡¯re invited to our dinner
party, Friday May 27 at 8:30
Party
May 27
add
Best roast chicken in San Francisco!
The waiter ignored us for 20 minutes.
Carter told Mubarak he shouldn¡¯t run again.
I need new batteries for my mouse.
The 13th Shanghai International Film Festival¡
µÚ13½ìÉϺ£¹ú¼ÊµçÓ°½Ú¿ªÄ»¡
The Dow Jones is up
Housing prices rose
Economy is
good
Q. How effective is ibuprofen in reducing
fever in patients with acute febrile illness?
I can see Alcatraz from the window!
XYZ acquired ABC yesterday
ABC has been taken over by XYZ
Where is Citizen Kane playing in SF?
Castro Theatre at 7:30. Do
you want a ticket?
The S&P500 jumped
Source: Dan Jurafsky
7. non-standard English
Great job @justinbieber! Were
SOO PROUD of what youve
accomplished! U taught us 2
#neversaynever & you yourself
should never give up either?
segmentation issues idioms
dark horse
get cold feet
lose face
throw in the towel
neologisms
unfriend
Retweet
bromance
tricky entity names
Where is A Bug¡¯s Life playing ¡
Let It Be was recorded ¡
¡ a mutation on the for gene ¡
the New York-New Haven Railroad
the New York-New Haven Railroad
Source: Dan Jurafsky (modified)
sarcasm
A: I love Justin Bieber. Do you
like him to?
B:Yeah. Sure. I absolutely love
him.
15. Tokenize Clean Stem Filter
Then a hurricane came, and devastation reigned
then a hurricane came and devastation reigned
then a hurricane came and devastation reigned
then a hurricane came and devastation reigned
43. ? Na?ve Bayes predicts 9 of the 12 papers
as written by Madison.
? K-NN predicts only 4 of the 12 papers
as written by Madison
? Why? How stable are these results??
Code Lines: 371-373