Customer Request Classification by Topic

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.
Showing 1 of 1 servicesAll 1566 services
Customer Request Classification by Topic
Medium
~3-5 business days
FAQ
AI Development Areas
AI Solution Development Stages
Latest works
  • image_website-b2b-advance_0.png
    B2B ADVANCE company website development
    1212
  • image_web-applications_feedme_466_0.webp
    Development of a web application for FEEDME
    1161
  • image_websites_belfingroup_462_0.webp
    Website development for BELFINGROUP
    852
  • image_ecommerce_furnoro_435_0.webp
    Development of an online store for the company FURNORO
    1041
  • image_logo-advance_0.png
    B2B Advance company logo design
    561
  • image_crm_enviok_479_0.webp
    Development of a web application for Enviok
    822

Implementation of Customer Request Topic Classification

Topic classification of requests — automatic determination of the subject of incoming customer request. Incoming stream of emails, chats, calls (after transcription) is distributed across topic categories without operator involvement.

Design of Topic Taxonomy

Before training the model, it's necessary to create the correct hierarchy. Common mistakes: too few classes (everything falls into "Other"), too many (500+ classes train poorly), blurry boundaries between classes.

Two-level hierarchy works well:

Technical Issues
    ├── Connection Problem
    ├── Slow Speed
    └── Personal Account Errors
Financial Questions
    ├── Payment and Tariffs
    ├── Refunds
    └── Debt
Contract Questions
    ...

First level: 5–15 categories — high recall. Second level: detail for routing to the right specialist.

Training the Classifier

Stage 1 — data collection: 200–500 examples per class from request archive with historical operator labels. Important: operators often label inaccurately → quality control stage for labeling is needed.

Stage 2 — baseline: TF-IDF + Logistic Regression. Trains quickly, interpretable, good for starting. Accuracy on typical tasks: 82–88%.

Stage 3 — BERT fine-tuning: DeepPavlov/rubert-base-cased. Accuracy: 90–95% with 500+ examples per class.

Stage 4 — LLM zero/few-shot: GPT-4o-mini with class descriptions — for new categories without retraining.

Handling Multi-Topic Requests

"My connection doesn't work and I want to change my tariff plan" — two classes. Strategies:

  • Multilabel classification (sigmoid + threshold 0.5)
  • Split by sentences, classify each
  • Select primary + secondary class

Monitoring and Drift

Topic distribution changes during sales, seasons, incidents. Monitoring: chi-square test for deviation from historical distribution. Alert on significant deviation → reassess classifier relevance.