See the script load.py
which scrapes wiki articles and stores it in Postgres. It uses the FRAMES dataset available in test.tsv
See generate_and_store_embeddings.py
and generate_and_store_embeddings_title.py
- Naive RAG:
rag.py
is implementation of a naive RAG technique on top-k chunks of wikipedia articles - Naive RAG on titles:
rag_titles.py
is implementation of a RAG technique which gets top-k complete articles - Agentic RAG:
agentic_rag.py
is implemnetation of an agentic RAG technique on top-k chunks of wikipedia articles - Agentic RAG on titles:
agentic_rag_titles.py
is implemetation of an agentic RAG technique on top-k complete articles
- See the PromptQL project in
my-assistant/
directory - Run it via
ddn run docker-start