A user asks how to cut agent token use with a personal RAG setup

A Reddit user says they keep marketing content, notes, and learnings in PDF, Word, and Excel files. Each time they work on a new idea, they upload the documents to an agent and ask it to design a system, which takes time and uses many tokens. The user asks how to build a selfhosted RAG that stores files by topic and lets them pull information when needed.

Key points

  • The user stores marketing files in PDF, Word, and Excel formats.
  • They currently upload documents to an agent for each new idea.
  • They say this workflow is time-consuming and token-heavy.
  • They want a selfhosted RAG for topic-wise file storage.
  • The goal is to reuse personal files when needed without uploading everything again.

Quick term guide

marketing
The activities used to tell people about a product and encourage them to buy it.
Content
Information or experiences, like articles or videos, provided through digital media.
Upload
The process of sending data from your computer to the internet.
selfhosted
Run on your own computer or server instead of relying fully on an outside service.
release
A published version of software that people can download or use.
context
The information an AI uses to understand your request, such as files, notes, and past messages.
workflow
A repeatable set of steps for getting a task done.
persona
A specific personality or role that an AI agent is set to play.
Read original