Tõlk.fm help

Frequently asked questions

Quick answers about getting started, event modes, languages, audio, billing, and privacy. For deeper guidance, see the Help Center.

Still have questions?Email hello@tolk.fm
Billing questionsEmail billing@tolk.fm
Privacy & legalEmail privacy@tolk.fm

Getting started

What is Tõlk.fm?

Tõlk.fm is a live AI translation platform for events. Organisers create an event room, stream or upload source audio, choose one or more target languages, and share a QR code. Audience members scan the code, pick their preferred language, and listen to a real-time translated audio stream — no app download required.

Who is Tõlk.fm for?

Tõlk.fm is designed for event organisers running multilingual conferences, summits, workshops, tours, panel discussions, and corporate events. Anyone who needs to address an audience in more than one language simultaneously can use Tõlk.fm in place of, or alongside, human interpretation.

Do I need to install anything?

No. Organisers manage events through the Tõlk.fm web dashboard at tolk.fm. Audience members open the join link in any modern mobile browser — no app download, no login, no account needed.

How do I create my first event?

Sign in to your Tõlk.fm account, click New event in the dashboard, choose your event mode (live translation, scheduled playback, or hybrid), select your source language and up to eight target languages, and save. You will receive a shareable join link and a QR code that you can display on screen or print.

How many listeners can join at the same time?

There is no hard cap on simultaneous listeners per channel. Tõlk.fm uses a broadcast architecture: the AI generates one translated audio stream per target language and distributes it to all listeners on that channel. Adding more listeners does not increase translation cost.

Can I test Tõlk.fm before my event?

Yes. You can run a test event at any time from the dashboard. For live translation, test in the same room and with the same microphone or mixer you plan to use at the real event. Pay attention to latency, audio quality, and language accuracy before going live. Credit usage accrues during test runs at the same rate as production events.

Always rehearse with your actual audio source. Microphone placement and room acoustics strongly affect translation quality.

Event modes

What is live translation mode?

In live translation mode, source audio is captured in real time — from a microphone, mixer, or other audio input — and streamed to the AI translation engine. The translated audio is broadcast to listeners within a few seconds. This mode is ideal for conferences, keynotes, and any event where the content is not known in advance.

What is scheduled playback mode?

In scheduled playback mode, the organiser uploads a pre-recorded audio or video file before the event. Tõlk.fm prepares translated audio tracks in all target languages. At the scheduled start time, listeners on each language channel hear the translated version of the recording in perfect sync.

What is hybrid mode?

Hybrid mode combines prepared translated tracks for pre-recorded segments with live AI translation for live segments such as Q&A, panel discussions, or speaker remarks.

What are audience capture modes?

Audience capture modes control how speaker audio is collected:

  • Conference / broadcast — Audio is captured centrally through the organiser's mixer or microphone. This is the default and works for most events.
  • Open floor — Audience members can request the microphone from their phone. One person speaks at a time. The organiser or a moderator approves requests.
  • Hybrid panel — Panelists use dedicated speaker-monitor devices while the floor remains open to the audience.
What is speaker monitor mode?

Speaker monitor mode gives the event speaker or interpreter a dedicated view showing live translations as they are generated. The speaker can also approve or deny open-mic requests from the audience.

Can I switch event mode or add languages after an event has started?

You can add or remove target languages during a live event from the runtime controls in the dashboard. You can also switch between translation engines (OpenAI and Gemini) while the event is running. Changing the fundamental event mode requires ending and recreating the event.

Languages & translation engines

Which languages does Tõlk.fm support?

The available languages depend on the translation engine chosen for the event.

OpenAI realtime translate — 13 target languages

  • EnglishEnglish
  • SpanishEspañol
  • PortuguesePortuguês
  • FrenchFrançais
  • Japanese日本語
  • RussianРусский
  • Chinese中文
  • GermanDeutsch
  • Korean한국어
  • Hindiहिन्दी
  • IndonesianBahasa Indonesia
  • VietnameseTiếng Việt
  • ItalianItaliano

Google Gemini live translate — 81 target languages (includes all OpenAI languages)

  • AfrikaansAfrikaans
  • AkanAkan
  • AlbanianShqip
  • Amharicአማርኛ
  • Arabicالعربية
  • ArmenianՀայերեն
  • AzerbaijaniAzərbaycan dili
  • BasqueEuskara
  • BelarusianБеларуская
  • Bengaliবাংলা
  • BulgarianБългарски
  • Burmese (Myanmar)မြန်မာစာ
  • CatalanCatalà
  • Chinese中文
  • Chinese (Simplified)简体中文
  • Chinese (Traditional)繁體中文
  • CroatianHrvatski
  • CzechČeština
  • DanishDansk
  • DutchNederlands
  • EnglishEnglish
  • EstonianEesti
  • FilipinoFilipino
  • FinnishSuomi
  • FrenchFrançais
  • GalicianGalego
  • Georgianქართული
  • GermanDeutsch
  • GreekΕλληνικά
  • Gujaratiગુજરાતી
  • HausaHausa
  • Hebrewעברית
  • Hindiहिन्दी
  • HungarianMagyar
  • IcelandicÍslenska
  • IndonesianBahasa Indonesia
  • ItalianItaliano
  • Japanese日本語
  • JavaneseBasa Jawa
  • Kannadaಕನ್ನಡ
  • KazakhҚазақ тілі
  • Khmerខ្មែរ
  • KinyarwandaIkinyarwanda
  • Korean한국어
  • Laoລາວ
  • LatvianLatviešu
  • LithuanianLietuvių
  • MacedonianМакедонски
  • MalayBahasa Melayu
  • Malayalamമലയാളം
  • Marathiमराठी
  • MongolianМонгол
  • Nepaliनेपाली
  • NorwegianNorsk
  • Norwegian (Bokmål)Norsk bokmål
  • Persianفارسی
  • PolishPolski
  • PortuguesePortuguês
  • Portuguese (Brazil)Português (Brasil)
  • Portuguese (Portugal)Português (Portugal)
  • Punjabiਪੰਜਾਬੀ
  • RomanianRomână
  • RussianРусский
  • SerbianСрпски
  • Sindhiسنڌي
  • Sinhalaසිංහල
  • SlovakSlovenčina
  • SlovenianSlovenščina
  • SpanishEspañol
  • SundaneseBasa Sunda
  • SwahiliKiswahili
  • SwedishSvenska
  • Tamilதமிழ்
  • Teluguతెలుగు
  • Thaiไทย
  • TurkishTürkçe
  • UkrainianУкраїнська
  • Urduاردو
  • UzbekOʻzbekcha
  • VietnameseTiếng Việt
  • ZuluisiZulu

If your target language is not in the OpenAI list, switch to Gemini in the event settings to access the full catalogue.

What is the difference between OpenAI and Gemini for translation?

Both engines provide real-time speech-to-speech translation, but they differ in several ways:

FeatureOpenAIGemini
Languages1381
Voice styleSelectable synthetic voice, adjustable speedPreserves speaker's original voice and intonation
TranscriptionSource-language captionsTarget-language captions
Best forClear synthetic output, common languagesNatural voice, rare or regional languages

You can switch engines live during an event. Test both before your event to choose the one that best fits your audience and content style.

How many target languages can I run at the same time?

You can run up to eight target language channels simultaneously per event. Each active channel is billed independently.

How accurate is the translation?

Translation accuracy depends on audio quality, speaking pace, accents, technical vocabulary, and background noise. For most conference and presentation contexts, AI translation is clear and understandable. However, AI outputs can contain omissions, mistranslations, or unsuitable phrasing.

Do not use Tõlk.fm as the sole translation source for medical, legal, financial, safety, emergency, or other high-stakes decisions. Always provide human alternatives where accuracy is critical or legally required.

Can I customise the translation voice?

When using the OpenAI engine, you can select from a range of synthetic voices and adjust playback speed. When using Gemini, the engine preserves the speaker's original voice characteristics.

Audio & sound setup

What audio input does Tõlk.fm use?

Tõlk.fm captures audio from the organiser's device microphone or from a connected external audio interface, USB mixer, or audio adapter. For large venues, connect a direct feed from the PA or mixing desk.

How much latency should I expect?

Live translation introduces a few seconds of latency. The AI engine needs to hear a phrase before it can translate it. Listeners should understand that translated audio will trail the speaker slightly.

Why does translation quality drop when multiple people speak at once?

AI translation is optimised for one speaker at a time. When multiple voices overlap, accuracy may drop. Use open floor or hybrid panel mode to give the floor to one speaker at a time.

What audio formats are supported for scheduled playback?

For scheduled playback, you can upload standard audio and video formats. The source audio is extracted, translated, and prepared before the scheduled start time.

Can listeners adjust the volume or mix original and translated audio?

Listeners receive the translated audio channel they selected. Volume is controlled through their device.

Listeners & joining

How do audience members join an event?

Share the join link or QR code with your audience. Listeners open the link, choose their preferred language, and start listening — no account or app download required.

Can I create a custom join URL for my event?

Yes. In event settings you can define a custom join slug — for example tolk.fm/my-conference. Custom slugs must be unique across all events.

Do listeners need headphones?

Headphones are strongly recommended to prevent feedback loops that degrade translation quality for everyone.

How many listeners can an event support?

There is no listener limit. All listeners on the same language channel receive the same broadcast stream.

Can listeners speak during the event?

With Open floor or Hybrid panel mode, listeners can request the microphone. The organiser or speaker approves the request and that audio is translated and broadcast.

What happens if a listener loses internet connection mid-event?

The listener's browser will attempt to reconnect automatically. Refreshing the join page reconnects them to the live stream at the current position.

Billing & credits

How does billing work?

Tõlk.fm charges $0.09 per minute per active translation channel, with a minimum charge of $15.00 per event regardless of actual usage.

How do I add funds to my wallet?

Go to the Billing page in your dashboard and choose a top-up plan or enter a custom amount (minimum $5). Payments are processed by Stripe. Funds appear in your wallet immediately after payment.

PlanYou payWallet receivesBonus
Starter$12$12
Pro$108$120+$12
Scale$480$600+$120
CustomAny amount ≥$5Same amount
When is my wallet charged?

Your wallet is charged at the end of each event based on actual minutes used and active language channels, subject to the $15.00 per-event minimum.

What happens if my wallet balance runs out during an event?

Active translation channels will stop when the wallet balance reaches zero. Top up before each event with enough funds to cover the expected duration.

How do I estimate the cost of my event?

Cost = duration in minutes × number of target languages × $0.09, with a $15.00 minimum. Example: 90-minute talk with 3 languages = $24.30.

Are there refunds?

Refunds are provided where required by law or approved case-by-case. Contact billing@tolk.fm with any billing concerns.

Privacy & data

Does Tõlk.fm store the audio from my events?

Live audio is processed transiently for real-time translation and is not retained by default. Uploaded media for scheduled playback is stored until you delete it or close your workspace.

Is event audio shared with OpenAI or Google?

Yes — audio is sent to the AI provider you select (OpenAI or Google Gemini) for translation. We use business-tier APIs under which your data is not used to train their models.

Do listeners need to create an account or share personal data?

No. Listeners join through a link or QR code without registering. Session and connectivity logs are collected to provide the stream and troubleshoot issues.

Is Tõlk.fm GDPR compliant?

Tõlk.fm is operated by Pixibit OÜ, an Estonian company subject to GDPR. Organisers are responsible for ensuring their use complies with applicable local laws.

What should I tell my event attendees about AI translation?

We recommend informing attendees:

  • Audio from this event is being streamed to AI services to produce real-time translation.
  • Translation is generated by an automated system and may not be perfectly accurate.
  • Listeners who use the open-mic feature should be aware their voice is also being translated and broadcast.

Local laws in some jurisdictions require advance disclosure or consent before recording or streaming audience audio. Check your local requirements before the event.

Developers & API

Does Tõlk.fm have a public API?

Yes. The Tõlk.fm Public API is a REST interface for partner products to programmatically create translation events, launch live sessions, stop sessions, and retrieve public listener links. API keys are profile-scoped, managed in the Developers dashboard, and billed against your existing prepaid wallet. Documentation is at tolk.fm/developers/docs.

What is streamUrl in the API response?

streamUrl is the public browser listener page for the live event (for example https://tolk.fm/your-code). For broadcast tools, use directStreamLinks from launch or POST /v1/events/{id}/direct-stream-links — WebSocket PCM16, HLS/AAC, or RTMP pull URLs on the media worker.

Can I use Tõlk.fm with YouTube, Vimeo, OBS, or vMix?

Yes. Route program audio into Tõlk.fm, share streamUrl with browser listeners, and mint directStreamLinks (WebSocket, HLS, RTMP) for OBS, vMix, or Castr. YouTube/Vimeo do not accept third-party translation tracks natively — run translated audio in parallel via HLS/RTMP or listener links.

Troubleshooting

Translation stopped or froze mid-event — what do I do?

First, check your internet connection and your wallet balance. If both are fine, try the following:

  1. Refresh the organiser control room page.
  2. Stop and restart the affected language channel.
  3. Switch to the other translation engine (OpenAI ↔ Gemini) to see if the issue is provider-side.
  4. Check the event error log at the bottom of the runtime view.

If the problem persists, email hello@tolk.fm with the event ID shown in the dashboard URL.

Listeners say the audio is choppy or cutting out — how do I fix it?

Audio instability is usually caused by one of:

  • Weak listener Wi-Fi or mobile data — advise listeners to move closer to the Wi-Fi access point.
  • Organiser's upload bandwidth — the audio capture device needs a stable connection to the Tõlk.fm media worker.
  • Too many devices on the same Wi-Fi — ensure the event Wi-Fi is separate from the public guest network.
The translation is accurate but very delayed — is that normal?

A few seconds of latency is normal for live AI translation. Very high latency (more than 10–15 seconds) may indicate high network latency, overloaded AI provider capacity, or an unusually slow audio stream.

  • High network latency between the organiser's device and our media worker
  • Overloaded AI provider capacity (rare)
  • An unusually slow or noisy audio stream requiring more buffering

Try switching engines or restarting the channel. If the delay is consistently above 10 seconds, contact support with the event ID.

The microphone is not being detected in the browser — what should I check?

Make sure you have granted microphone permissions to the Tõlk.fm site in your browser. If you accidentally denied permission:

  • In Chrome: click the lock icon in the address bar → Microphone → Allow.
  • In Firefox: click the lock icon → Clear permissions → Reload and try again.
  • In Safari: open Settings → Websites → Microphone → allow tolk.fm.
I see a 'Wallet balance too low to start' error — what should I do?

Your wallet needs to have at least the event minimum ($15.00) available before a live event can start. Go to the Billing page and top up your wallet, then return to the event and try starting again.

Where can I find my event ID for support requests?

The event ID appears in the URL of the event control room page: tolk.fm/events/[event-id].

Developers & integrations

Bring Tõlk.fm into your applications

The Tõlk.fm Public API lets partner products integrate real-time voice translation into their own software and automated workflows — without building interpretation infrastructure, media workers, or listener delivery yourself. Expand how and where multilingual access shows up for your users and clients.

Programmatic control

Create events, add language channels, launch live translation, and stop sessions from your backend — no custom media infrastructure required.

Public listener links

Each launch returns a streamUrl: the public browser page where listeners pick a language and hear translated audio in real time.

Automated workflows

Wire Tõlk.fm into registration systems, venue apps, AV runbooks, and partner products so multilingual access ships with your own UX.

LIVESTREAMS & PRODUCTION

Pair your main program feed on YouTube, Vimeo, or your encoder with public listener links from the API, or mint direct WebSocket, HLS, and RTMP pull URLs for OBS, vMix, Castr, and custom production workflows. Route program audio into Tõlk.fm from a mixer, interface, or production feed.

Still have questions?

We're happy to help

For general enquiries email hello@tolk.fm, or explore the Help Center for in-depth guides on events, billing, audio setup, and privacy.