Advent of Haystack

Try out Haystack 2.0-Beta to discover what’s coming in the next major release

with 10 challenges in the month of December 🎉

Every few days one of the doors in this page will open to reveal a new challenge

Submit your results and discuss solutions with the community 🎄

Haystack is quite a large project. So, unsurprisingly, the Haystack elves always maintain documentation. But, they’re also releasing new versions of Haystack regularly. To make it easier for people who use Haystack in production, they also version the documentation. So, if there’s something you need to learn about a specific version of Haystack, it’s best to switch to the documentation of that version!

In this challenge, Elf Silvano wanted to showcase how you can do metadata filtering to do RAG and question-answering on only a specific segment of your data in your database. However, he simply forgot to actually implement the filters. He’s prepared a Colab where the documents and their metadata are already indexed. Your task is to use a querying pipeline with metadata filtering. You need to specify a filtering condition to retrieve only documents that fulfill that condition. In this challenge, your task is to filter to the specific version of documentation that you want to do question-answering on, which is Haystack 2.0 🚀

🕊️ Hint: For this challenge, the InMemoryBM25Retriever component and the Pipeline are going to be very useful.

💜 Here is a Starter Colab

📚 Useful Docs: