hyperopia

Experiments in working with Wikipedia offline using database dumps.

setup (45+gb)

Download - enwiki-latest-pages-articles-multistream-index.txt.bz2 - enwiki-latest-pages-articles-multistream.xml.bz2 - enwiki-latest-categorylinks.sql.gz

from http://dumps.wikimedia.org/enwiki/latest/

import the enwiki category database into mysql:

% zcat enwiki-latest-categorylinks.sql.gz | mysql -u USER -p DBNAME

copy mysqlpassword.example.py to mysqlpassword.py and fill in your db info

make indices % python2 mk-indices.py /path/to/enwiki-latest-pages-articles-multistream.xml.bz2

serve % python2 serve.py /path/to/enwiki-latest-pages-articles-multistream.xml.bz2

git clone git://git.numm.org/hyperopia

snapshot: hyperopia.zip

files