Audio Transcription API

The Audio Transcription service allows for the transcription of audio files and offers optional translation into multiple languages.

Overview

The transcription API accepts audio files and returns accurate text transcriptions, with support for different languages.

This endpoint is OpenAI compatible

Transcription Endpoint

POST /v1/audio/transcriptions

This endpoint transcribes audio files into text format, with an option for translation into different languages.

Request Parameters

Parameter

Type

Required

Description

file

Yes

The audio file to transcribe (mp3, mp4, wav, m4a, webm)

model

string

Model to use (default: "KrosMLingualSTT1.0.0")

language

string

Optional language specification

prompt

string

Text to guide the transcription

response_format

string

Output format (default: "json")

temperature

float

Model temperature (default: 0.0)

Request Body

file (file, required): The audio file to be transcribed. Various audio formats are supported.

Response

choices (array):
- text (string): The resulting transcribed and optionally translated text.
- index (integer): The array index of the transcription choice.
- finish_reason (string): Explanation of why the transcription process concluded.
model (string): Identifies the transcription/translation model used.
object (string): Specifies the type of response object.

Example Request

curl -X POST -H "Authorization: Bearer YOUR_API_KEY" -F "file=@path/to/your/audio.mp3" -F "target_language=en" https://api.krosai.com/api/v1/audio/transcriptions

Example Response

{
  "id": "trans_1234567890",
  "object": "transcription",
  "created": 1682456789,
  "model": "KrosMLingualSTT1.0.0",
  "text": "This is the transcribed text from the audio file.",
  "usage": {
    "prompt_tokens": 0,
    "completion_tokens": 45,
    "total_tokens": 45
  }
}

Best Practices

Transcription

Use high-quality audio for better results
Specify the language when known for improved accuracy
Keep background noise to a minimum

Error Handling

All APIs return standard HTTP status codes:

200: Success
400: Bad request (check parameters)
401: Unauthorized (check API key)
429: Rate limit exceeded
500: Server error

Error responses include a detail field with more information about the error.

PreviousText Summarization API NextSentiment Analysis API

Last updated 7 months ago