NER & NLP

30+

Domain-specific NER and natural language understanding

Named entity recognition models specialized for healthcare (BioBERT, ClinicalBERT), finance (FinBERT, SecBERT), legal (LegalBERT), and general purpose (BERT-NER, GLiNER). Extract entities, classify text, and understand domain-specific language in pure C#.

Medical Records Financial Analysis Legal Document Review News Processing Customer Support Research Papers Compliance Data Extraction

General Purpose NER

Extract persons, organizations, locations, and custom entities from any text.

BERT-NER

BERT-based token classification for standard named entity recognition.

GLiNER

Generalist model for open-vocabulary NER without predefined entity types.

UniNER

Universal NER with instruction tuning for any entity type.

LUKE

Language Understanding with Knowledge-based Embeddings for entity-aware NER.

Flair NER

Contextual string embeddings for sequence labeling.

Healthcare & Biomedical

Extract medical entities, drug names, diseases, and clinical concepts.

BioBERT

BERT pre-trained on PubMed abstracts for biomedical text mining.

PubMedBERT

Domain-specific pre-training on PubMed for biomedical NLP.

ClinicalBERT

BERT adapted for clinical notes and electronic health records.

BioMegatron

NVIDIA large-scale biomedical language model.

Finance & Legal

Extract financial entities, legal terms, and regulatory concepts.

FinBERT (NER)

Financial entity recognition for company names, tickers, and financial terms.

SecBERT

NER trained on SEC filings for regulatory entity extraction.

LegalBERT

Pre-trained on legal corpora for legal entity and clause identification.

Scientific & Multilingual

Specialized NER for scientific text and multiple languages.

SciBERT

BERT pre-trained on scientific papers for research entity extraction.

MatSciBERT

Materials science BERT for extracting materials, properties, and processes.

XLM-RoBERTa NER

Multilingual NER covering 100+ languages.

mBERT NER

Multilingual BERT for cross-lingual entity recognition.

NER with AiModelBuilder

C#
using AiDotNet;

// Train an NER model with AiModelBuilder
var result = await new AiModelBuilder<float, float[], float>()
    .ConfigureModel(new BioBERT<float>(variant: "v1.1"))
    .ConfigureOptimizer(new AdamOptimizer<float>())
    .ConfigurePreprocessing()
    .ConfigureDataLoader(nerDataLoader)
    .BuildAsync();

var entities = result.Predict(textEmbedding);

Start building with NER & NLP

All 30+ implementations are included free under Apache 2.0.