GigaChat (Sber) Language Model Fine-Tuning

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.
Showing 1 of 1 servicesAll 1566 services
GigaChat (Sber) Language Model Fine-Tuning
Complex
from 1 week to 3 months
FAQ
AI Development Areas
AI Solution Development Stages
Latest works
  • image_website-b2b-advance_0.png
    B2B ADVANCE company website development
    1212
  • image_web-applications_feedme_466_0.webp
    Development of a web application for FEEDME
    1161
  • image_websites_belfingroup_462_0.webp
    Website development for BELFINGROUP
    852
  • image_ecommerce_furnoro_435_0.webp
    Development of an online store for the company FURNORO
    1041
  • image_logo-advance_0.png
    B2B Advance company logo design
    561
  • image_crm_enviok_479_0.webp
    Development of a web application for Enviok
    822

Fine-Tuning GigaChat Language Model (Sber)

GigaChat is Sber's language model, available through GigaChat API. Model fine-tuning is implemented within the corporate segment (GigaChat Pro / GigaChat Enterprise) and allows you to adapt the model to your business specifics. The model is originally oriented toward Russian-language market, making it a priority choice for tasks involving Russian content, Russian accounting documents (RSBU), and interactions within Russian legal framework.

Access to GigaChat Fine-Tuning

GigaChat API is provided through Sber Cloud platform. For fine-tuning, an enterprise contract and access to appropriate tier are required. Standard API access provides base models GigaChat Lite, GigaChat Pro, and GigaChat Max; fine-tuning is available through GigaChat Enterprise or special request to corporate department.

Authentication via OAuth 2.0:

import requests
import base64

credentials = base64.b64encode(
    f"{client_id}:{client_secret}".encode()
).decode()

response = requests.post(
    "https://ngw.devices.sberbank.ru:9443/api/v2/oauth",
    headers={
        "Authorization": f"Basic {credentials}",
        "RqUID": "unique-request-id",
    },
    data={"scope": "GIGACHAT_API_CORP"}
)
access_token = response.json()["access_token"]

Data Format for Fine-Tuning

GigaChat accepts training data as dialogue pairs in JSON format. Each example is a complete conversation context with roles:

{
  "messages": [
    {
      "role": "system",
      "content": "You are an insurance company assistant. Help customers understand insurance product conditions."
    },
    {
      "role": "user",
      "content": "What is covered under CASCO insurance for accidents caused by third parties?"
    },
    {
      "role": "assistant",
      "content": "For accidents caused by third parties under CASCO: collision damage regardless of fault is covered, restoration repair costs are covered..."
    }
  ]
}

Advantages for Russian Market

Financial regulatory compliance: GigaChat is trained on Russian regulatory documents, including Central Bank guidelines, RSBU standards, tax reporting forms. This reduces data volume needed for financial task fine-tuning.

Medical documentation: Ministry of Health templates, clinical guidelines, ICD-10, Social Fund integration. Native support for Russian medical terminology.

Government services: work with SMEV documents, department formats, specific public sector terminology.

Data security: data remains within Sber Cloud perimeter, critical for banks, insurance companies, government organizations.

Practical Example: Assistant for Banking Chatbot

Task: fine-tune GigaChat Pro for processing incoming requests to retail bank — answering product questions, routing complex requests to operators.

Dataset: 3500 dialogues from real correspondence (anonymized), covering 45 topics (loans, deposits, cards, transfers, transaction disputes).

Data preparation stages:

  1. Extract dialogues from CRM
  2. Depersonalize (replace names, card numbers, phones)
  3. Filter dialogues with negative outcomes (customer got no answer)
  4. Mark complex cases requiring routing
  5. Balance by topics (no more than 15% from one category)

Results:

  • CSAT (customer satisfaction with bot answer): 3.2 → 4.1 out of 5
  • Correct routing percentage: 71% → 94%
  • Escalation rate reduction (requests passed to operator): 61% → 38%
  • Average dialogue time: -22%

GigaChat Fine-Tuning Limitations

  • Closed weights: like GPT-4o, you get hosted endpoint without weight access
  • Infrastructure lock-in: only Sber Cloud, no on-premise deployment
  • Corporate threshold: fine-tuning unavailable on free tiers
  • Context size: 32K tokens — smaller than Qwen2.5 or Claude 3.5 Sonnet

Comparison with Related Solutions

Parameter GigaChat YandexGPT Llama (self-hosted)
Ecosystem Sber Cloud Yandex Cloud Arbitrary
Russian language Excellent Excellent Good
152-FZ compliance Yes Yes Yes (on-prem)
Integrations SberBusiness API Yandex Tracker/Telemost REST/OpenAI-compat
Fine-tuning access Enterprise Enterprise Open

Project Timeline

  • Task audit, dataset evaluation: 3–5 days
  • Data preparation and depersonalization: 2–4 weeks
  • Iterative training: 1–2 weeks
  • Testing, A/B: 1 week
  • Integration, monitoring: 1–2 weeks
  • Total: 5–9 weeks