- https://gist.github.com/HonzaKral/35451c6322a78d4de8f5 to load stack overflow data. Clone into load_stack subdirectory within this repo.
- Dba StackExchange dump from http://www.clearbits.net/creators/146-stack-exchange-data-dump - the relevant files (Posts.xml, Users.xml and Comments.xml) should be found in the data subdirectory in this repo.
- elasticat (https://gist.github.com/karmi/5670573) for pretty-printing es results
- kibana 3 with dashboard loaded from kibana-dash.json
Run:
BULK_FILE=$PWD/bulk.json ES_INDEX=dba-stack load_stack/load_stack.sh $PWD/data
From this repo. This should generate a bulk.json that can be re-used during the presentation (second run of this script won't have to generate it from the xml).
At this moment it's good idea to get kibana up and running and load the dashboard, verify that it's sane and displays correct data.
Clear all your indices:
curl -X DELETE http://localhost:9200
You should be set to go, enjoy!