Palabra AI TypeScript Library

🌍 A TypeScript library for Palabra AI's real-time speech-to-speech translation API. 🚀 Break down language barriers and enable seamless communication across 25+ languages.

Overview 📋

🎯 The @palabra-ai/translator TypeScript library enables you to integrate real-time speech translation into your Web applications. Whether you're building a new application, enhancing an existing product, or streamlining business processes, this library has the tools you need. With Palabra AI, you can:

⚡ Translate live speech in real time, making conversations smooth and natural
🎙️ Preserve the original speaker's voice and tone in translated speech
🔄 Convert spoken language instantly into accurate, readable text — great for captions, accessibility, and analysis

Installation

npm install @palabra-ai/translator
# or
pnpm add @palabra-ai/translator
# or
yarn add @palabra-ai/translator

Prerequisites

A modern web browser (uses WebRTC and Web Audio APIs)
Palabra API credentials

Quick Start

Follow the steps below to run your first translation using Palabra AI's TypeScript library.

1. Get a local audio track

Use a function to return a MediaStreamTrack from the user's microphone:

import { getLocalAudioTrack } from '@palabra-ai/translator';

2. Initialize the client

import { PalabraClient } from '@palabra-ai/translator';

const client = new PalabraClient({
  auth: {
    clientId: 'YOUR_CLIENT_ID',
    clientSecret: 'YOUR_CLIENT_SECRET',
  },
  translateFrom: 'en', // Source language code
  translateTo: 'es',   // Target language code
  handleOriginalTrack: getLocalAudioTrack, // Function returning a MediaStreamTrack
});

3. Start translation

await client.startTranslation();

4. Play translated audio

await client.startPlayback();

5. Stop translation and playback

await client.stopPlayback();
await client.stopTranslation();

6. Output device changing

Note

Audio output device switching is supported only in browsers that implement setSinkId(). In unsupported browsers like Safari, this method will have no effect.

await client.changeAudioOutputDevice('deviceId')

7. Volume changing for audio track by language

Note

Volume should be a value between 0.0 and 1.0, where 0.0 is muted and 1.0 is maximum volume.

client.setVolume('es', .7)

Note

Browsers may restrict audio playback initiated without user interaction. Each browser may also define user interaction differently. (For example, Safari on iOS is restrictive.)

API Reference

See TypeScript types for full API documentation.

PalabraClient

The PalabraClient class is the main entry point for integration with the Palabra API. It manages connection setup, session lifecycle, audio handling, transcription and translation events, and playback of translated speech.

Features

Key features of PalabraClient:

Connects to the Palabra API
Manages translation sessions
Manages language settings
Emits events for transcription and translation results
Plays translated audio in the browser
Manages target languages and session configuration

Constructor

new PalabraClient(options: PalabraClientData)

Parameters

auth: Authentication data (either clientId/clientSecret or userToken)
translateFrom: Source language code (e.g., 'en')
translateTo: Target language code (e.g., 'es')
handleOriginalTrack: Function returning the original audio track (MediaStreamTrack)
apiBaseUrl (optional): API URL (defaults to Palabra cloud)

Public Methods

startTranslation(): Promise<boolean>
Starts a translation session and connects the audio stream. Returns true on success.
stopTranslation(): Promise<void> Stops a translation session and disconnects the transport.
startPlayback(): Promise<void>
Enables playback of translated audio in the browser.
stopPlayback(): Promise<void>
Stops playback of translated audio.
setTranslateFrom(langCode: SourceLangCode): Promise<void>
Changes the source language for translation on the fly.
setTranslateTo(langCode: TargetLangCode): Promise<void>
Changes the target language for translation on the fly.
addTranslationTarget(langCode: TargetLangCode): Promise<void>
Adds a target language for translation.
removeTranslationTarget(langCode: TargetLangCode | TargetLangCode[]): Promise<void>
Removes one or more target languages from translation.
muteOriginalTrack(): void
Mutes the original audio track (microphone).
unmuteOriginalTrack(): void
Unmutes the original audio track (microphone).
setVolume(language: string, volume: number): void
Set volume for audio track by given language. Volume should be between 0.0 (muted) and 1.0 (maximum)
changeAudioOutputDevice(deviceId: string): Promise<void> Change output device

Note: Audio output device switching is supported only in browsers that implement setSinkId(). In unsupported browsers like Safari, this method will have no effect.
cleanup(): Promise<void>
Stops translation and playback, releases resources, and resets the client to its initial state.

Events

The PalabraClient class provides events that let you track connection status, receive audio tracks, and handle transcription and translation results.

You can use these events to update your UI, handle errors, and get real-time updates during the speech processing flow — from connecting to receiving translated audio and text.

const client = new PalabraClient({
  auth: {
    clientId: 'YOUR_CLIENT_ID',
    clientSecret: 'YOUR_CLIENT_SECRET',
  },
  translateFrom: 'en', // Source language code
  translateTo: 'es',   // Target language code
  handleOriginalTrack: getLocalAudioTrack, // Function returning a MediaStreamTrack
});

client.on(EVENT_REMOTE_TRACKS_UPDATE, (tracksData) => {
    // Process tracks
});

EVENT_REMOTE_TRACKS_UPDATE - An update has occurred to the set of remote audio tracks. (Use this event to access new audio streams.)
EVENT_ROOM_CONNECTED - The WebRTC room connection was established.
EVENT_ROOM_DISCONNECTED - The WebRTC room connection closed or lost.
EVENT_CONNECTION_STATE_CHANGED - The connection state has changed (e.g., connecting, connected, disconnected).
EVENT_DATA_RECEIVED - Custom data or messages have been received from the server via the WebRTC data channel.
EVENT_START_TRANSLATION - The translation process has started.
EVENT_STOP_TRANSLATION – The translation process has stopped.
EVENT_TRANSCRIPTION_RECEIVED - The full transcription (recognized text) of the source audio has been received.
EVENT_TRANSLATION_RECEIVED - The full, written translation of the source audio has been received.
EVENT_PARTIAL_TRANSLATED_TRANSCRIPTION_RECEIVED - A partial translation of the transcription has been received.
EVENT_PARTIAL_TRANSCRIPTION_RECEIVED - A partial transcription has been received. (Useful for real-time updates.)
EVENT_PIPELINE_TIMINGS_RECEIVED - Timing or performance data about the translation pipeline has been received. (Useful for diagnostics and/or analytics.)
EVENT_ERROR_RECEIVED - An error in the translation or streaming process has occurred.

Usage Examples

Basic: Start translation and playback

import { PalabraClient, getLocalAudioTrack } from '@palabra-ai/translator';

// 1. Create the client
const client = new PalabraClient({
  auth: {
    clientId: 'YOUR_CLIENT_ID',
    clientSecret: 'YOUR_CLIENT_SECRET',
  },
  translateFrom: 'en',
  translateTo: 'es',
  handleOriginalTrack: getLocalAudioTrack,
});

// 2. Start translation session
await client.startTranslation();

// 3. Start playback of translated audio
await client.startPlayback();

// 4. Stop translation and playback when done
await client.stopPlayback();
await client.stopTranslation();

Advanced: Output translated audio to a custom `<audio>` element

Listen for the EVENT_REMOTE_TRACKS_UPDATE event to get the translated audio tracks and play them in your own <audio> element:

import { PalabraClient, getLocalAudioTrack } from '@palabra-ai/translator';
import { EVENT_REMOTE_TRACKS_UPDATE } from '@palabra-ai/translator';

// Create an <audio> element in your code
const audioElement = new Audio();

const client = new PalabraClient({
  auth: {
    clientId: 'YOUR_CLIENT_ID',
    clientSecret: 'YOUR_CLIENT_SECRET',
  },
  translateFrom: 'en',
  translateTo: 'fr',
  handleOriginalTrack: getLocalAudioTrack,
});

client.on(EVENT_REMOTE_TRACKS_UPDATE, (tracks) => {
  // tracks - RemoteTrackInfo[]
  // tracks is an array of { track: MediaStreamTrack, ... }
  audioElement.srcObject = new MediaStream(tracks.map(t => t.track));
  audioElement.play();
});

// Start translation as usual
await client.startTranslation();

// Handle playback 
const stopPlayback = () => {
  audioElement.value.pause();
};

const startPlayback = () => {
  audioElement.value.play();
};

The examples below show how to integrate Palabra's real-time translation into any web application and control audio output as needed.

Monorepo Structure

Development Setup

This project contains two main packages:

@palabra-ai/translator: The main library package packages/lib
dev-app: A Vue.js development application for testing the library packages/dev-app

Prerequisites

Node.js (latest LTS version recommended)
pnpm

Installation

# Install dependencies for all packages
pnpm install

Running in Development Mode

Library Development

Run the library in watch mode (auto-rebuild on changes):

cd packages/lib
pnpm dev

Development Application

Run the dev app with hot-reload:

cd packages/dev-app
pnpm dev

Open http://localhost:5173 in your browser to view the dev app.

More Commands

Library Package `packages/lib`

pnpm build - Build the library
pnpm test - Run tests
pnpm lint - Run linting

Development App `packages/dev-app`

pnpm build - Build for production
pnpm dev - Run dev app

Supported Languages

Speech Recognition Languages

🇸🇦 Arabic (AR), 🇨🇳 Chinese (ZH), 🇨🇿 Czech (CS), 🇩🇰 Danish (DA), 🇳🇱 Dutch (NL), 🇬🇧 English (EN), 🇫🇮 Finnish (FI), 🇫🇷 French (FR), 🇩🇪 German (DE), 🇬🇷 Greek (EL), 🇮🇱 Hebrew (HE), 🇭🇺 Hungarian (HU), 🇮🇹 Italian (IT), 🇯🇵 Japanese (JA), 🇰🇷 Korean (KO), 🇵🇱 Polish (PL), 🇵🇹 Portuguese (PT), 🇷🇺 Russian (RU), 🇪🇸 Spanish (ES), 🇹🇷 Turkish (TR), 🇺🇦 Ukrainian (UK)

Translation Languages

🇸🇦 Arabic (AR), 🇧🇬 Bulgarian (BG), 🇨🇳 Chinese Mandarin (ZH), 🇨🇿 Czech (CS), 🇩🇰 Danish (DA), 🇳🇱 Dutch (NL), 🇬🇧 English UK (EN_GB), 🇺🇸 English US (EN_US), 🇫🇮 Finnish (FI), 🇫🇷 French (FR), 🇩🇪 German (DE), 🇬🇷 Greek (EL), 🇮🇱 Hebrew (HE), 🇭🇺 Hungarian (HU), 🇮🇩 Indonesian (ID), 🇮🇹 Italian (IT), 🇯🇵 Japanese (JA), 🇰🇷 Korean (KO), 🇵🇱 Polish (PL), 🇵🇹 Portuguese (PT), 🇧🇷 Portuguese Brazilian (PT_BR), 🇷🇴 Romanian (RO), 🇷🇺 Russian (RU), 🇸🇰 Slovak (SK), 🇪🇸 Spanish (ES), 🇲🇽 Spanish Mexican (ES_MX), 🇸🇪 Swedish (SV), 🇹🇷 Turkish (TR), 🇺🇦 Ukrainian (UK), 🇻🇳 Vietnamese (VN)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
packages		packages
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

License

PalabraAI/palabra-ai-javascript

Folders and files

Latest commit

History

Repository files navigation

Palabra AI TypeScript Library

Overview 📋

Installation

Prerequisites

Quick Start

1. Get a local audio track

2. Initialize the client

3. Start translation

4. Play translated audio

5. Stop translation and playback

6. Output device changing

7. Volume changing for audio track by language

API Reference

PalabraClient

Features

Constructor

Parameters

Public Methods

Events

Usage Examples

Basic: Start translation and playback

Advanced: Output translated audio to a custom <audio> element

Monorepo Structure

Development Setup

Prerequisites

Installation

Running in Development Mode

Library Development

Development Application

More Commands

Library Package packages/lib

Development App packages/dev-app

Supported Languages

Speech Recognition Languages

Translation Languages

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Advanced: Output translated audio to a custom `<audio>` element

Library Package `packages/lib`

Development App `packages/dev-app`

Packages