← Back to sonar

The web speaks. Now your agent can listen.

Audio has always been the unindexed half of the internet. News broadcasts, podcast discussions, earnings calls, committee hearings — spoken, then gone. Sonar changes that for AI agents.

May 2025 · The Sonar Team · ~3 min read · Public Beta

Half the Web, None of the Signal

Agents are text-literate but audio-blind

If you've spent time building an AI agent — a research assistant, a market intelligence tool, a news monitor — you've bumped into the same ceiling. Your agent can read an article. It can parse a PDF. It can search the web and pull back a dozen URLs to scrape and summarize.

But the moment the relevant information exists only as audio, your agent goes quiet. Not because the information isn't out there. The Fed chair gave a speech last Tuesday. The CEO said something interesting on a panel in Davos. A senator made an explicit commitment during a committee hearing. All of it happened. All of it is technically "on the internet." None of it is retrievable by your agent.

Audio is arguably where the most unguarded, substantive information lives — and it's been invisible to AI agents until now.

Executives say things on earnings calls that never make it into press releases. Scientists describe their findings at conferences weeks before the paper drops. Politicians commit to positions in hearings that don't surface in any searchable text. The gap isn't in what was said. It's in what was indexed.

Why the obvious workarounds fall short

Approach
Agent-compatible?
The actual problem
Scrape YouTube captions
Partial
Only indexed content; no cross-source search; auto-captions are unreliable; no speaker attribution
Run Whisper on podcast feeds
Impractical
You need the file before you can transcribe it — no way to search across sources you haven't already discovered
RSS + transcript services
Fragmented
Podcast-only; no news radio, social audio, earnings calls, or historical archives
Web search + hoping
Indirect
Returns articles about what was said — not the source clip with timestamp and speaker
Sonar API
Yes
Natural language query in; ranked clips with accurate transcripts, timestamps, and speaker attribution out

What Sonar is

One tool call. Ranked clips. Citable sources.

Sonar is a search API designed specifically for the AI agent loop. You pass a natural language question — a topic, event, speaker, or claim — and you get back ranked audio clips with accurate transcripts, precise timestamps, and speaker attribution. Your agent can quote them, summarize them, or chain further searches from them.

The index spans news broadcasts, podcasts, radio, earnings calls, social audio, and archives. It updates continuously. You describe what you're looking for; Sonar finds where it was said.

01 — ASK

Agent calls Sonar

A natural language question — topic, event, speaker, or claim — passed as a tool call in your agent loop

02 — FIND

Sonar searches audio

Sonar searches news, podcasts, radio, earnings calls, social audio, and archives — and returns what matches.

03 — CITE

Results your agent uses

Transcript, timestamp, speaker, and source — ready to quote, summarize, or follow up on


Real scenarios

What this actually unlocks

The clearest way to understand what Sonar enables is to think about specific agents and the questions that would previously stop them.

Market intelligence

An analyst agent tracking a public company can now catch what the CFO said on a CNBC appearance — not just the quarterly filing. Executives hedge and forecast in interviews in ways that never reach investor relations documents.

"What has Jensen Huang said about TSMC dependence?"
Policy monitoring

Committee hearings produce hours of audio that takes days to become searchable text — if ever. A policy agent can now track what senators and agency heads are actually saying in real time, before the summary gets written.

"What are lawmakers saying about grid storage?"
Media intelligence

News radio and podcast discussions often run ahead of the written record. A media monitoring agent can track narratives as they emerge in audio — surfacing a talking point in syndicated radio six hours before it hits a headline.

"How is the tariff reversal being framed on business radio?"
Research & fact-checking

Conference talks and expert panels contain direct primary-source statements. A research agent can now cite a scientist's conference remarks with a timestamp — the way you'd cite a paper — weeks before the preprint exists.

"What did Demis Hassabis say about protein folding at NeurIPS?"

The API

Designed to drop into your existing agent

Sonar exposes two modes: Retrieve for fast semantic search returning ranked clips, and Research for a full synthesis — a summary with key themes and cited audio evidence. Both are reachable through the same endpoint.

Retrieve ranked clips Research synthesized summary
POST https://api.sonarapi.dev/v1/audio.search
query string · req Natural language question, topic, or claim. No boolean syntax needed.
mode string retrieve (default) for ranked clips · research for synthesized summary with citations
sources array Filter by source type: news, podcast, radio, earnings, social, archive
date_range object { from, to } ISO 8601 dates. Omit for all-time.
top_k integer Number of results to return. Default 5, max 20.

Each result in the response looks like this:

JSON response
{
  "results": [
    {
      "source":     "Bloomberg Surveillance",
      "source_type": "news",
      "speaker":    "Mary Daly, SF Fed President",
      "timestamp":  "2025-03-18T09:14:32Z",
      "clip_start": 412,  // seconds into recording
      "clip_end":   447,
      "transcript": "We're not seeing the kind of labor market softening
        that would give us confidence to move faster. I think
        two cuts this year is the right framing.",
      "relevance":  0.94,
      "source_url": "https://..."
    }
  ]
}

The response is citation-ready. Your agent gets the source, the speaker, the exact timestamp, and a clean transcript — everything needed to ground a claim in a primary source rather than paraphrasing something that paraphrased something else.

Using it as an LLM tool

Sonar is built to live inside a tool loop. Here's the full integration as a function definition for an OpenAI-compatible tool schema — paste this in and it works:

TypeScript
// Tool definition for your agent
const sonarTool = {
  type: "function",
  function: {
    name: "search_audio",
    description: "Search public audio — news, podcasts, radio,
      earnings calls, social audio, and archives.
      Returns ranked clips with transcripts,
      timestamps, and speaker attribution.",
    parameters: {
      type: "object",
      properties: {
        query: { type: "string" },
        mode:  { type: "string", enum: ["retrieve", "research"] }
      },
      required: ["query"]
    }
  }
};

// Handler
async function search_audio({ query, mode = "retrieve" }) {
  const res = await fetch("https://api.sonarapi.dev/v1/audio.search", {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
      "Authorization": `Bearer ${SONAR_API_KEY}`
    },
    body: JSON.stringify({ query, mode })
  });
  return res.json();
}

What a result looks like in practice

Say your agent is building a brief on Federal Reserve rate expectations. It calls Sonar with "Fed rate cut outlook 2025". Here's what comes back:

Sonar retrieve — top results
Bloomberg Surveillance NEWS Mar 18, 2025 · 6:54 AM
"We're not seeing the kind of labor market softening that would give us confidence to move faster. I think two cuts this year is the right framing, and I'd caution against pricing in anything more aggressive than that."
Mary Daly · SF Fed President · relevance 0.94
Odd Lots Podcast PODCAST Mar 14, 2025 · 38:12 into episode
"The thing the market keeps getting wrong is the sequencing. The first cut isn't the story — it's whether they can deliver on the path afterward. Right now the path looks very uncertain."
Neil Dutta · Renaissance Macro · relevance 0.91
Senate Banking Committee Hearing ARCHIVE Feb 11, 2025 · 2:08:44 into session
"Chairman Powell, I want a direct answer: is the committee committed to two cuts before the midterm election? ... [Powell]: I want to be clear that our decisions are entirely data-dependent and calendar considerations play no role."
Sen. Warren + Jerome Powell · relevance 0.88

Three primary sources, three speakers, three timestamps. Your agent didn't scrape anything, didn't pre-download any files, and didn't rely on a journalist's summary of what was said. It has the actual words, with provenance.

This is the difference between your agent knowing what people are saying and your agent knowing what people wrote about what was said. For anything that requires grounding a claim in a primary source, the gap is significant.


Under the hood

How the index works

Sonar continuously indexes audio from public sources — news networks, podcast directories, radio station feeds, public earnings calls, congressional archives, and social audio platforms. The corpus grows every day.

When your agent asks a question, Sonar searches that index and returns the moments that matter — ranked clips with transcript, timestamp, speaker, and source. Not pages to scrape. Not articles to parse. The actual spoken record, made usable.

The two modes reflect how agents actually use audio search. Retrieve is for when your agent needs raw evidence — the actual clip — to cite or quote. Research is for when your agent needs to synthesize across multiple sources: it finds related clips and returns a structured summary with every claim pinned to its audio source.


What's next

We're building this with early users

Sonar is in public beta. If you're building agents that need to listen to the web — not just read it — we'd love to hear from you at sonarapi.com.

Early users get a direct line to the founding team. We're onboarding a limited number of developers and teams right now, and the people who build on Sonar in the next few months will directly shape what the API becomes. If you have a use case we're not covering yet — a source type, a workflow, a constraint we should know about — we want to hear from you.

Audio is half the web. It's time agents could use it.

See what Sonar can do Try a live query in the playground, or tell us what you're building.

Questions? Reach us at contact@sonarapi.com.

— The Sonar team