Cold Outreach Swarm - Multi-Agent B2B Prospecting Engine

Built an autonomous prospecting engine that fuses a research swarm, a writing agent grounded on the prospect company, and a deliverability supervisor. Replaces the work of a 4-SDR team on a per-account basis.

The Challenge

This early-stage outbound sales agency needed to scale personalized B2B outreach across forty distinct client accounts simultaneously without hiring a massive SDR team. Templated email marketing and LinkedIn outreach sequences generated extremely low reply rates of under 2% and triggered high domain blacklisting risks due to over-sending. The agency required an autonomous system that could research prospect organizations, extract technographic data, write highly personalized messaging that read as human-written copy, and rotate senders safely. The primary technical challenge involved building a multi-tenant orchestration pipeline that could store, analyze, and optimize campaign learnings. It was critical to maintain strict database isolation across client vectors and tenant spaces to prevent sensitive target metrics and proprietary campaign data from leaking between competing clients.

Pain points we set out to solve

×Sub-2% reply rates on templated cold sequences
×High domain blacklist risk from over-sending
×Manual research took 20+ minutes per account
×No feedback loop from replies back to copy improvement

Objectives

01Push qualified reply rates above 8% on seeded cohorts
02Research an account end-to-end in under 60 seconds
03Keep per-domain send rates under deliverability thresholds
04Let each client tenant keep its data and learnings isolated

Approach

How we delivered — phased, with clear checkpoints and evidence at each step.

Week 1-3
Agent graph design
Mapped the outbound workflow into four agent roles - researcher, writer, scheduler, and supervisor - wired through a LangGraph state machine with explicit handoff contracts and retry logic.
Week 4-7
Research swarm
Built a parallel research swarm that scrapes the prospect site, pulls LinkedIn via Playwright, summarizes recent news, and detects the tech stack via Wappalyzer. Outputs a structured brief the writer uses as grounded context.
Week 8-11
Writer and RAG corpus
The writer agent drafts email and LinkedIn copy grounded on the research brief plus a per-tenant RAG corpus of prior winning sends. No generic templates - every send is composed from first principles for the account.
Week 12-14
Deliverability supervisor and feedback loop
A supervisor checks every draft for spam triggers, enforces per-domain daily caps, and rotates sender identities. A reply-ingestion job tags responses by outcome and feeds them back into the RAG corpus as positive or negative examples.

The Solution

Four agents on a LangGraph state machine handle the full outbound loop: research -> write -> schedule -> supervise. Each tenant has its own RAG corpus of prior wins, so the writer agent compounds on what works for that client without cross-contamination. The supervisor is the deliverability brake that keeps campaigns safe. The system rotations and daily schedules ensure compliance with major email hosts.

Research swarm

Parallel agents pull site content, LinkedIn profile, recent news, and tech stack into a structured brief in under 60 seconds per account.

RAG-grounded writer

Every send is composed from the research brief plus retrieved examples of prior wins from the same tenant corpus - no generic templates.

Deliverability supervisor

Validates spam triggers, enforces per-domain daily caps, rotates sender identities, and pauses campaigns automatically when reply quality drops.

Multi-tenant isolation

Each client gets its own Postgres schema, vector namespace, and sender pool - so learnings and data never leak between tenants.

Reply-outcome feedback loop

A daily job tags inbound replies as positive, neutral, or negative and mints training examples for the RAG corpus, so the system gets sharper every week.

Technology stack

Picked for latency, cost, and long-term maintainability — not for novelty.

AI / Agent

LangGraphLangChainGPT-4oClaude 3.5 Sonnettext-embedding-3-large

Scraping / Research

PlaywrightPatchrightScraplingWappalyzer

Messaging / Infra

SMTP rotation poolPostgrespgvectorRedisCeleryFastAPI

Observability

LangSmithGrafanaSentry

Results

9.4%Qualified reply rate (from ~1.8% baseline)

47sMedian account-research time

4xAccounts per SDR-hour of effort

0Blacklisted sender domains in 90 days

Business impact

The agency onboarded 12 new clients in the quarter without adding headcount. Two existing clients renewed at 2.5x prior spend after seeing reply-rate lift in the first 30 days.

Key takeaways

Outbound writing quality is bounded by research quality - invest in the research swarm first
A deliverability supervisor is non-negotiable at scale, otherwise you will lose a domain before you learn what did not work
Multi-tenant RAG corpora turn each client into a compounding advantage instead of a one-shot campaign

Visual reference

Cold Outreach Swarm - Multi-Agent B2B Prospecting Engine — reference 1

The Challenge

Pain points we set out to solve

Objectives

Approach

Agent graph design

Research swarm

Writer and RAG corpus

Deliverability supervisor and feedback loop

The Solution

Research swarm

RAG-grounded writer

Deliverability supervisor

Multi-tenant isolation

Reply-outcome feedback loop

Technology stack

AI / Agent

Scraping / Research

Messaging / Infra

Observability

Results

Business impact

Key takeaways

Visual reference

Ready to start something similar?