Internal Document Search with AI: Build a…

May 31

In this proof of concept, I share how I built a local RAG system using Qwen1.5 (0.5B), Ollama and LangChain, a step-by-step pipeline to query your documents and understand how it works.

Read →

4 Comments

Pritam Sonavne

Jun 2Edited

This is great for start. Also, Do we have similar apis for Java ?

Expand full comment

Reply (1)

Nina

Jun 10Edited

Thanks Pritam! Yes, but not end-to-end in Java yet

The best approach now is to use Python + LangChain to handle embeddings, chunking, and vector DB setup (Qdrant, Weaviate, etc.).

Then, your Java app just queries the vector DB and calls the LLM (via Ollama’s HTTP API).

Cleaner separation, better tools, and no need to force everything into Java.

Expand full comment

Jawahar

Jun 1

Thanks for sharing . Great way to get hands-on with RAG

Expand full comment

Reply (1)

Nina

Jun 1

Thanks a lot, Jawhar!

Expand full comment