Custom Vocabulary Implementation for STT System

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.
Showing 1 of 1 servicesAll 1566 services
Custom Vocabulary Implementation for STT System
Simple
from 1 business day to 3 business days
FAQ
AI Development Areas
AI Solution Development Stages
Latest works
  • image_website-b2b-advance_0.png
    B2B ADVANCE company website development
    1212
  • image_web-applications_feedme_466_0.webp
    Development of a web application for FEEDME
    1161
  • image_websites_belfingroup_462_0.webp
    Website development for BELFINGROUP
    852
  • image_ecommerce_furnoro_435_0.webp
    Development of an online store for the company FURNORO
    1041
  • image_logo-advance_0.png
    B2B Advance company logo design
    561
  • image_crm_enviok_479_0.webp
    Development of a web application for Enviok
    822

Implementing a custom vocabulary for the STT system. A custom vocabulary is the fastest way to improve recognition of specific terms, names, and abbreviations without retraining the model. It acts as a hint to the STT engine: "pay special attention to these words." ### Implementation for the main AWS Transcribe Custom Vocabulary providers:

import boto3

transcribe = boto3.client('transcribe')

# Создаём словарь из файла (S3)
transcribe.create_vocabulary(
    VocabularyName='corporate-terms-v1',
    LanguageCode='ru-RU',
    VocabularyFileUri='s3://my-bucket/vocabulary.txt'
)

# Формат файла vocabulary.txt:
# Phrase\tSoundsLike\tIPA\tDisplayAs
# Б-Ф-И-О\tбэ эф и о\t\tБФИО
# ИНН\tин эн эн\t\tИНН
```**Azure Custom Speech**:```python
# Добавляем domain adaptation data через Azure Portal или REST API
# Поддерживает: pronunciation dictionary, phrase list
import requests

phrase_list = {
    "kind": "PhraseList",
    "locale": "ru-RU",
    "phrases": ["ОГРН", "СНИЛС", "КПП", "расчётный счёт"]
}
```**faster-whisper with hints via initial prompt**:```python
model = WhisperModel("large-v3", device="cuda")

# Начальный промпт помогает модели ориентироваться на нужную лексику
initial_prompt = "ИНН, ОГРН, СНИЛС, КПП, расчётный счёт, генеральный директор."

segments, _ = model.transcribe(
    audio,
    initial_prompt=initial_prompt,
    language="ru"
)
```The initial_prompt method works unreliably for long files—the prompt is processed only for the first window. ### Dictionary maintenance - Dictionary versioning (v1, v2...) - Automatic updates when new terms appear - A/B testing of versions on representative audio Timeframe: 1–2 days for basic integration, including dictionary filling.