> For the complete documentation index, see [llms.txt](https://petes-organization-3.gitbook.io/speechmatics-docs/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://petes-organization-3.gitbook.io/speechmatics-docs/speech-to-text/overview.md).

# Overview

## Speech to text overview

Learn how to turn audio into text.

Use speech to text to transcribe using one of the modes:

* **Real-time processing** \[anchor ↓ ]: Stream audio from an input device or file and receive instant updates of the transcription as it happens
* **Batch processing** \[anchor ↓ ]: Submit an audio file and receive a complete text transcription once the processing is finished

### Developer quickstart

<table data-card-size="large" data-view="cards"><thead><tr><th></th><th></th></tr></thead><tbody><tr><td><strong>Transcribe in real-time</strong></td><td>Instantly convert streaming audio to text with Real-time processing</td></tr><tr><td><strong>Transcribe a file</strong></td><td>Use Batch processing to accurately turn your audio files into text</td></tr></tbody></table>

{% hint style="info" %}
**Tip**\
The quickest way to transcribe voice from audio is in our web portal.\
Click 'View config' button when using the transcription to get the code and use it in the API.
{% endhint %}

### Deployments

Turn live audio into accurate transcripts — instantly.

The Speechmatics real-time speech to text API \[link: RT API ref] converts spoken audio into text with low latency and high accuracy.

### Real-time processing

Turn live audio into accurate transcripts — instantly.

The Speechmatics real-time speech to text API \[link: RT API ref] converts spoken audio into text with low latency and high accuracy.

{% hint style="info" %}
**Info**

* Use when speed matters
* Transcribe live broadcasts or events
* Caption webinars, meetings, or podcasts in real time
* Power voice assistants or AI agents with live input
* Monitor contact center calls as they happen
* Build accessibility features like live captions
  {% endhint %}

Operating points

Choose between two accuracy models when configuring your real-time session:

* **Standard** — fast, efficient, and suitable for most use cases
* **Enhanced** — offers improved accuracy, especially for complex audio (e.g. noisy environments, varied accents), with slightly higher resource usage

### Batch processing

*TBC…*


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://petes-organization-3.gitbook.io/speechmatics-docs/speech-to-text/overview.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
