Language (NLP)

Text Analytics, NER, sentiment, classification, translation and Q&A with Azure Cognitive services.

Technology Cluster · Back to Cognitive Services · NLP use cases

Core services

Text Analytics

Detect sentiment, language, key phrases, PI/PII and insights from text & documents.

Named Entity Recognition (NER)

Extract entities such as people, organizations, places, tax IDs, IBANs (where supported).

Text classification

Ticket routing, moderation, and content categorization (single or multi‑label).

Translation

Neural translation in real‑time with custom glossaries and format preservation.

Question Answering

Build knowledge bases from FAQs/documents and enable Q&A in chat and portals.

Integration patterns

APIs & SDKs

Use regional endpoints with keys/role assignments. Text Analytics example:

POST /text/analytics/v3.2/analyze
Ocp-Apim-Subscription-Key: <key>
Content-Type: application/json

Data pipelines

Event‑driven with Functions/Service Bus, batch via Data Factory; for custom models move to Azure ML.

Quick comparison

ServiceWhen to useOutput
Text AnalyticsGeneral insights from textSentiment, key phrases, language, PII
NERExtract structured fieldsEntities + types + offsets/confidence
ClassificationRouting, moderation, categoriesLabels (single/multi) with scores
TranslationLocalization & multilingualTranslated text + glossaries
Q&ASelf‑service informationAnswers from KB/FAQs

Best practices

Data quality

Normalize encoding, remove boilerplate, handle language and domain specifics.

Evaluation

Precision/recall/F1, error analysis and A/B tests on real samples.

Governance

PII masking, logging, rate limiting, and drift/cost monitoring.

FAQ

Multilingual support?

Yes, many services support dozens of languages; check the ones you need and validate quality with custom glossaries.

Can I train a custom classifier?

Yes, with custom classification where available or with Azure ML for full pipelines.

How to handle sensitive text?

Enable PII detection, apply masking and access controls; use compliant regions and limited retention.