home

Hello, My name is Pavel

I'm a UI Architect working at EIS insurance software company.

This website is my collection of notes on software development and software architecture. It’s not meant to be a traditional personal blog. The idea is to capture and explain technical concepts, mostly through charts rather than long articles. At the end of each post, you’ll find diagrams or charts that present the core idea visually.

How I learn:

  • Reading is one way you can learn.
  • Taking notes on what you’ve read is even better.
  • Applying knowledge in practice is the best way to learn.

I’m also one of the main contributors to the Kraken rules engine — a tool that evaluates rules across both runtimes (UI and Java) and ensures consistent results.

I'm constantly learning and documenting new knowledge bits ( buzzwords ) in charts.

This is my last post:

RAG

May 12, 2025
2 min
1 chart

What is RAG?

RAG stands for Retrieval Augmented Generation. This is the process when the Large Language Models (LMM) input is enriched with a contextual information from the external sources by a provided prompt.

What problem it solves?

  • Stale or outdated data. Retrieving relevant data from the external sources and data which is up to date.

  • Invalid answers or hallucinations. Adds data which may be missing, new or proprietary.

What is the Process?

The RAG process consists of three main stages: Retrieval

  • orchestrator accepts the prompt in a free text form. "Where are libraries in my district?"

  • creates a vector query from the prompt. Vector is also called vector embedding. Vector is a floating point number (1.345). Vector embedding is a series of vectors (e.g. [1.34, 4.78, -0.35]).

  • vector query is used to retrieve information from vector database.

  • vector database can match similar vector by a vector query and returns top matching documents or text passages Augmented

  • The retrieved text passages are integrated into the original prompt, adding specific context. For example: "Context: Libraries in users district are located in X street 1km away form his location. Question: Where are libraries in my district?" Generation

  • LLM generates response based on prompt and context

Is Vector database mandatory?

No, a vector database is not mandatory for the RAG process. However, it is highly recommended due to its efficiency, scalability, and high accuracy in matching semantically similar texts, significantly improving the quality and reliability of retrieval results.

© 2025 buzzchart.info