RAGed

Local Development

Run the full raged locally using Docker Compose. No cloud services required.

Startup

sequenceDiagram
    participant U as Developer
    participant DC as Docker Compose
    participant PG as Postgres
    participant OL as Ollama
    participant API as RAG API
    participant WK as Worker

    U->>DC: docker compose up -d
    DC->>PG: Start (port 5432)
    DC->>OL: Start (port 11434)
    DC->>API: Start (port 8080)
    API->>PG: Connect
    API->>OL: Connect
    
    Note over U,WK: For enrichment stack:
    U->>DC: docker compose --profile enrichment up -d
    DC->>WK: Start enrichment worker
    WK->>PG: Poll for tasks using SKIP LOCKED
    
    U->>API: curl /healthz
    API-->>U: { ok: true }
    U->>OL: Pull nomic-embed-text
    OL-->>U: Model ready

Steps

Base Stack (Vector Search Only)

# 1. Start core services (Postgres, Ollama, API)
docker compose up -d

# 2. Verify the API is running
curl -s http://localhost:8080/healthz
# → {"ok":true}

# 3. Pull the embedding model (first time only)
curl http://localhost:11434/api/pull -d '{"name":"nomic-embed-text"}'

Full Stack (with Enrichment & Knowledge Graph)

# 1. Start all services including enrichment worker
docker compose --profile enrichment up -d

# 2. Verify the API is running
curl -s http://localhost:8080/healthz
# → {"ok":true}

# 3. Pull the embedding model (first time only)
curl http://localhost:11434/api/pull -d '{"name":"nomic-embed-text"}'

# 4. (Optional) Pull LLM for tier-3 extraction
curl http://localhost:11434/api/pull -d '{"name":"llama3"}'

# 5. Verify enrichment is enabled
curl -s http://localhost:8080/enrichment/stats
# → {"queue":{"pending":0,...},"totals":{...}}

Services

Base Stack

Service	Port	Purpose
`api`	8080	RAG API (Fastify)
`postgres`	5432	Vector database (with pgvector extension)
`ollama`	11434	Embedding model runtime

Enrichment Stack (–profile enrichment)

Service	Port	Purpose
`enrichment-worker`	-	Python enrichment worker (background service)

Optional: Enable Auth Locally

Set RAGED_API_TOKEN in docker-compose.yml under the api service:

environment:
  RAGED_API_TOKEN: "my-dev-token"

Then pass --token my-dev-token to CLI commands (or set RAGED_API_TOKEN env var).

Optional: Configure Enrichment

Enrichment is enabled via Docker Compose profiles. To customize enrichment behavior, set environment variables in docker-compose.yml:

API service:

environment:
  ENRICHMENT_ENABLED: "true"  # Enable enrichment features
  DATABASE_URL: "postgresql://raged:password@postgres:5432/raged"  # Postgres connection

Worker service:

environment:
  DATABASE_URL: "postgresql://raged:password@postgres:5432/raged"
  OLLAMA_URL: "http://ollama:11434"
  WORKER_CONCURRENCY: "4"  # Number of concurrent tasks
  EXTRACTOR_PROVIDER: "ollama"  # Options: ollama, anthropic, openai
  EXTRACTOR_MODEL_FAST: "llama3"  # Fast model for quick extraction
  EXTRACTOR_MODEL_CAPABLE: "llama3"  # Capable model for complex extraction
  EXTRACTOR_MODEL_VISION: "llava"  # Vision model for image inputs

Tear Down

# Stop services (keep data)
docker compose down

# Stop services and delete data volumes
docker compose down -v

Developing the API

For hot-reload during API development:

cd api
npm install
DATABASE_URL=postgresql://raged:password@localhost:5432/raged OLLAMA_URL=http://localhost:11434 npm run dev

This runs the API directly on your machine while Postgres and Ollama run in Docker.

Developing the CLI

cd cli
npm install
npm run dev -- index --repo <url> --api http://localhost:8080

# Test URL ingestion
npm run dev -- ingest --url https://example.com/article --api http://localhost:8080

URL Ingestion Examples

The API supports server-side URL fetching for web pages, PDFs, and other content types:

Via HTTP API

Ingest a web article:

curl -s -X POST http://localhost:8080/ingest \
  -H "Content-Type: application/json" \
  -d '{
    "items": [{
      "url": "https://example.com/article"
    }]
  }'

Ingest a PDF from URL:

curl -s -X POST http://localhost:8080/ingest \
  -H "Content-Type: application/json" \
  -d '{
    "items": [{
      "url": "https://example.com/whitepaper.pdf",
      "source": "Example Whitepaper"
    }]
  }'

Mixed batch (URLs + text):

curl -s -X POST http://localhost:8080/ingest \
  -H "Content-Type: application/json" \
  -d '{
    "items": [
      {"url": "https://example.com/article"},
      {"text": "Direct text content", "source": "notes/snippet.txt"}
    ]
  }'

Via CLI

# Ingest a web page
node dist/index.js ingest --url https://example.com/article --api http://localhost:8080

# Ingest a PDF
node dist/index.js ingest --url https://example.com/whitepaper.pdf --api http://localhost:8080

Supported Content Types:

HTML (Readability article extraction)
PDF (pdf-parse text extraction)
Plain text, Markdown (passthrough)
JSON (pretty-printed)

SSRF Protection: URL ingestion includes automatic security protections:

Blocks private IP ranges (10.x.x.x, 192.168.x.x, 127.x.x.x, etc.)
DNS rebinding defense: resolves hostname before request and rejects private IPs
Fixed 30-second request timeout
Rejects non-HTTP/HTTPS schemes

This site is open source. Improve this page.