Project · 2025
OSRS RAG
Retrieval-augmented Q&A system over the Old School RuneScape wiki. Scraper, indexer, and query pipeline as separable scripts.
OSRS has one of the densest community knowledge bases on the internet, which makes it a good corpus to learn retrieval on.
The system is split into a handful of small scripts that each do one job:
scraper.pypulls down the wiki.indexer.pychunks and embeds it.query.pyis the front door for asking it questions.update.pyre-syncs when the wiki changes.
Most of the learning was in the boring parts — chunk-size tuning, when reranking actually helps, and how badly an LLM can hallucinate when it confidently mistakes one quest for another. The biggest takeaway: most “good RAG” is chunking, not embeddings.