Developing User-friendly and Customizable Text AnalyzerL室g親僥寄僥 徭隼冱囂I尖冩梢片
?
Yuki Miyanishi and Kazuhide Yamamoto. Developing User-friendly and Customizable Text Analyzer. The International Conference on Practical Linguistics of Japanese (ICPLJ8), pp.172-173 (2014.3)
Developing User-friendly and Customizable Text AnalyzerL室g親僥寄僥 徭隼冱囂I尖冩梢片
?
Yuki Miyanishi and Kazuhide Yamamoto. Developing User-friendly and Customizable Text Analyzer. The International Conference on Practical Linguistics of Japanese (ICPLJ8), pp.172-173 (2014.3)
12. 箭猟
12
The ship may have sunk but the movie didn't!!! Director, James Cameron, from 'The Terminator' did
it again with this amazing picture. One of my favorite scenes is 'The Dinner table' scene, in which
Rose's family and friends meet Jack after he saves her. Rose has a look on her face that every
woman should have when you meet 'THE ONE'...I hope I have that look when I am in the room with
my future husband.<br /><br />Jack and Rose have a connection that is 'MOVIE STUFF' but it's good
movie stuff. We have the greedy mom and all her elite stuck up associates who live off of their
husbands wealth. Rose almost commits suicide but the Gilbert Grape star rescues her. I really liked
the hanging over the boat scene. It was a good risk.<br /><br />The movie is long but it's fantastic!!!
Good story, good flow, good actors!!! Go see it twice if you want, Its worth it!!!
竃灸
Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011).
Learning Word Vectors for Sentiment Analysis.
The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011).
18. 箭猟
1. This movie is terrible. It¨s a waste of time.
2. This movie was good and made me happy. Had a very good time.
3. This movie is just boring.
1. [movie, terrible, waste, time]
2. [movie, good, made, happy, good, time]
3. [movie, boring]
18
念I尖
19. 竃F指方 (bag of words)
? テキストごとの光gZの竃F指方をそのまま蒙翮燭箸垢
19
boring good happy made movie terrible time waste
Text 1 0 0 0 0 1 1 1 1
Text 2 0 2 1 1 1 0 1 0
Text 3 1 0 0 0 1 0 0 0
20. 竃Fl業(TF: Term Frequency)
? テキストごとの光gZの竃F指方を畠gZ方で護った、
蒙翮燭箸垢
? ???,? =
テキスト ?におけるgZ ?の竃F指方
テキスト ?の畠gZ方
20
boring good happy made movie terrible time waste
Text 1 0.00 0.00 0.00 0.00 0.25 0.25 0.25 0.25
Text 2 0.00 0.33 0.17 0.17 0.17 0.00 0.17 0.00
Text 3 0.50 0.00 0.00 0.00 0.50 0.00 0.00 0.00
21. TF-IDF (IDF: Inverse Document Frequency)
? TFにし、ほぼすべてのテキストに竃Fするような
仝レア業の詰い々gZの嶷みを和げる
? ???? = log
畠テキスト方
gZ ?が竃Fするテキスト方
? ??????,? = ???,? 〜 ????
21
boring good happy made movie terrible time waste
IDF 0.48 0.48 0.48 0.48 0.00 0.48 0.30 0.48
Text 1 0.00 0.00 0.00 0.00 0.00 0.12 0.08 0.12
Text 2 0.00 0.16 0.08 0.08 0.00 0.00 0.05 0.00
Text 3 0.24 0.00 0.00 0.00 0.00 0.00 0.00 0.00
23. word2vec
? Distributed Representation of
Words and Phrases and their
Compositionality
(Mikolov et al., 2013)
? 慌軟する┰くにFれるg
Zを僥させる
? ベクトルどうしのvS來もベ
クトルで燕せる
? Paris C France + Japan = ?
23
24. その麿のgZ托めzみモデル
? GloVe: Global Vectors for Word Representation
? Pennington et al., 2014
? BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
? Devlin, et al. 2018
24