User Tools

Site Tools


it:training_of_german_word_embedding_for_nlp

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
it:training_of_german_word_embedding_for_nlp [2019/05/30 12:01]
pmay [Step by step]
it:training_of_german_word_embedding_for_nlp [2019/05/30 12:25] (current)
pmay [Step by step]
Line 7: Line 7:
     - ''​dewiki-20190520-pages-articles.xml.bz2''​ at the moment     - ''​dewiki-20190520-pages-articles.xml.bz2''​ at the moment
   - Use wikiextractor:​ https://​github.com/​attardi/​wikiextractor   - Use wikiextractor:​ https://​github.com/​attardi/​wikiextractor
-    - ''<​nowiki>​python WikiExtractor.py -o data/​we_output --processes 8 -b 100M data/​dewiki-20190520-pages-articles.xml.bz2</​nowiki>''​+    - ''<​nowiki>​python WikiExtractor.py -o data/​we_output --processes 8 data/​dewiki-20190520-pages-articles.xml.bz2</​nowiki>''​
  
 ===== WikiExtractor.py ===== ===== WikiExtractor.py =====
it/training_of_german_word_embedding_for_nlp.txt · Last modified: 2019/05/30 12:25 by pmay