Multi-task efficient zero-shot sequence classification models
AI & ML interests
Text classification, relations extraction, NER, computational biology
Recent Activity
View all activity
Papers
The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
-
knowledgator/gliner-bi-edge-v2.0
Token Classification • Updated • 88 • 7 -
knowledgator/gliner-bi-small-v2.0
Token Classification • Updated • 23 • 4 -
knowledgator/gliner-bi-base-v2.0
Token Classification • Updated • 147 • 5 -
knowledgator/gliner-bi-large-v2.0
Token Classification • Updated • 113 • 12
A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition
The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type.
-
knowledgator/gliner-x-large
Token Classification • Updated • 356 • 35 -
knowledgator/gliner-x-base
Token Classification • Updated • 73 • 9 -
knowledgator/gliner-x-small
Token Classification • Updated • 56 • 16 -
knowledgator/gliner-x-small-v0.5
Token Classification • Updated • 174 • 4
GLiClass with ModernBERT backbone
-
knowledgator/gliclass-modern-large-v2.0
0.4B • Updated • 27 • 3 -
knowledgator/gliclass-modern-base-v2.0
0.2B • Updated • 104 • 2 -
knowledgator/gliclass-base-v2.0-rac-init
Zero-Shot Classification • 0.2B • Updated • 70 • 11 -
knowledgator/gliclass-modern-base-v2.0-init
Zero-Shot Classification • 0.2B • Updated • 499 • 25
Bi-encoder and poly-encoder architectures of GLiNER
-
knowledgator/gliner-bi-small-v1.0
Token Classification • Updated • 69 • 10 -
knowledgator/gliner-bi-base-v1.0
Token Classification • Updated • 7 • 4 -
knowledgator/gliner-bi-large-v1.0
Token Classification • Updated • 56 • 24 -
knowledgator/gliner-poly-small-v1.0
Token Classification • Updated • 135 • 14
Generalist and Light-weighted Models for Zero-shot Text Classification
-
GLiClass SandBox
🌖13Classify text with zero-shot classification
-
knowledgator/gliclass-large-v1.0-init
Zero-Shot Classification • 0.4B • Updated • 24 • 14 -
knowledgator/gliclass-base-v1.0-init
Zero-Shot Classification • 0.2B • Updated • 2 • 2 -
knowledgator/gliclass-small-v1.0-init
Zero-Shot Classification • 0.1B • Updated • 6 • 5
Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit.
-
knowledgator/comprehend_it-base
Zero-Shot Classification • Updated • 7.26k • • 87 -
MoritzLaurer/deberta-v3-large-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.4B • Updated • 3.69k • • 58 -
MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7
Zero-Shot Classification • 0.3B • Updated • 224k • • 351 -
MoritzLaurer/deberta-v3-base-zeroshot-v1.1-all-33
Zero-Shot Classification • Updated • 17k • • 30
Collection of auto-regressive models tuned for text classification
Collection of pre-trained encoder models trained on large molecules databases.
GLiNER-bi-Encoder models for entity linking with the GLiNKER framework
PII detection models developed in collaboration with Wordcab
-
knowledgator/gliner-pii-large-v1.0
Token Classification • Updated • 2.93k • 34 -
knowledgator/gliner-pii-base-v1.0
Token Classification • Updated • 2.97k • 12 -
knowledgator/gliner-pii-small-v1.0
Token Classification • Updated • 211 • 7 -
knowledgator/gliner-pii-edge-v1.0
Token Classification • Updated • 338 • 10
Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy.
-
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
Paper • 2508.07662 • Published • 9 -
knowledgator/gliclass-edge-v3.0
Text Classification • 32.7M • Updated • 168 • 17 -
knowledgator/gliclass-modern-base-v3.0
Text Classification • 0.2B • Updated • 488 • 4 -
knowledgator/gliclass-modern-large-v3.0
Text Classification • 0.4B • Updated • 78 • 14
Collection of high-quality GLiNER models tuned for working with biomedical data
-
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
Paper • 2504.00676 • Published • 5 -
Ihor/gliner-biomed-large-v1.0
Token Classification • Updated • 207 • 14 -
Ihor/gliner-biomed-base-v1.0
Token Classification • Updated • 69 • 5 -
Ihor/gliner-biomed-small-v1.0
Token Classification • Updated • 20 • 3
GLiNER models based on modern encoder architectures
Collection of initial models and models that use converted decoders to encoders as backbones
-
knowledgator/Qwen-encoder-0.5B
Question Answering • 0.5B • Updated • 114 • 10 -
knowledgator/Llama-encoder-1.0B
Question Answering • 1B • Updated • 765 • 3 -
knowledgator/Sheared-LLaMA-encoder-1.3B
Question Answering • 1B • Updated • 9 • 2 -
knowledgator/Qwen-encoder-1.5B
Question Answering • 2B • Updated • 12 • 2
Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks.
-
UTC HandyLab
📚3 -
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25 -
knowledgator/UTC-DeBERTa-large-v2
Token Classification • 0.4B • Updated • 12 • 24 -
knowledgator/UTC-DeBERTa-base-v2
Token Classification • 0.2B • Updated • 41
Knowledgator GLiNER models for information extraction
-
knowledgator/gliner-multitask-v1.0
Token Classification • Updated • 5.51k • 37 -
knowledgator/gliner-multitask-large-v0.5
Token Classification • Updated • 668 • 138 -
GLiNER HandyLab
⚡84Perform multiple NLP tasks on your text
-
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25
Collection of models for converting chemical formats between each other.
-
knowledgator/SMILES2IUPAC-canonical-small
Text Generation • 5.97M • Updated • 9.81k • 7 -
knowledgator/IUPAC2SMILES-canonical-base
Text Generation • Updated • 2.22k • 6 -
knowledgator/IUPAC2SMILES-canonical-small
Text Generation • 5.79M • Updated • 6 • 5 -
knowledgator/SMILES2IUPAC-canonical-base
Text Generation • Updated • 24k • 9
Collection of datasest for various information extraction tasks.
Multi-task efficient zero-shot sequence classification models
GLiNER-bi-Encoder models for entity linking with the GLiNKER framework
-
knowledgator/gliner-bi-edge-v2.0
Token Classification • Updated • 88 • 7 -
knowledgator/gliner-bi-small-v2.0
Token Classification • Updated • 23 • 4 -
knowledgator/gliner-bi-base-v2.0
Token Classification • Updated • 147 • 5 -
knowledgator/gliner-bi-large-v2.0
Token Classification • Updated • 113 • 12
PII detection models developed in collaboration with Wordcab
-
knowledgator/gliner-pii-large-v1.0
Token Classification • Updated • 2.93k • 34 -
knowledgator/gliner-pii-base-v1.0
Token Classification • Updated • 2.97k • 12 -
knowledgator/gliner-pii-small-v1.0
Token Classification • Updated • 211 • 7 -
knowledgator/gliner-pii-edge-v1.0
Token Classification • Updated • 338 • 10
A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition
Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy.
-
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
Paper • 2508.07662 • Published • 9 -
knowledgator/gliclass-edge-v3.0
Text Classification • 32.7M • Updated • 168 • 17 -
knowledgator/gliclass-modern-base-v3.0
Text Classification • 0.2B • Updated • 488 • 4 -
knowledgator/gliclass-modern-large-v3.0
Text Classification • 0.4B • Updated • 78 • 14
The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type.
-
knowledgator/gliner-x-large
Token Classification • Updated • 356 • 35 -
knowledgator/gliner-x-base
Token Classification • Updated • 73 • 9 -
knowledgator/gliner-x-small
Token Classification • Updated • 56 • 16 -
knowledgator/gliner-x-small-v0.5
Token Classification • Updated • 174 • 4
Collection of high-quality GLiNER models tuned for working with biomedical data
-
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
Paper • 2504.00676 • Published • 5 -
Ihor/gliner-biomed-large-v1.0
Token Classification • Updated • 207 • 14 -
Ihor/gliner-biomed-base-v1.0
Token Classification • Updated • 69 • 5 -
Ihor/gliner-biomed-small-v1.0
Token Classification • Updated • 20 • 3
GLiClass with ModernBERT backbone
-
knowledgator/gliclass-modern-large-v2.0
0.4B • Updated • 27 • 3 -
knowledgator/gliclass-modern-base-v2.0
0.2B • Updated • 104 • 2 -
knowledgator/gliclass-base-v2.0-rac-init
Zero-Shot Classification • 0.2B • Updated • 70 • 11 -
knowledgator/gliclass-modern-base-v2.0-init
Zero-Shot Classification • 0.2B • Updated • 499 • 25
GLiNER models based on modern encoder architectures
Bi-encoder and poly-encoder architectures of GLiNER
-
knowledgator/gliner-bi-small-v1.0
Token Classification • Updated • 69 • 10 -
knowledgator/gliner-bi-base-v1.0
Token Classification • Updated • 7 • 4 -
knowledgator/gliner-bi-large-v1.0
Token Classification • Updated • 56 • 24 -
knowledgator/gliner-poly-small-v1.0
Token Classification • Updated • 135 • 14
Collection of initial models and models that use converted decoders to encoders as backbones
-
knowledgator/Qwen-encoder-0.5B
Question Answering • 0.5B • Updated • 114 • 10 -
knowledgator/Llama-encoder-1.0B
Question Answering • 1B • Updated • 765 • 3 -
knowledgator/Sheared-LLaMA-encoder-1.3B
Question Answering • 1B • Updated • 9 • 2 -
knowledgator/Qwen-encoder-1.5B
Question Answering • 2B • Updated • 12 • 2
Generalist and Light-weighted Models for Zero-shot Text Classification
-
GLiClass SandBox
🌖13Classify text with zero-shot classification
-
knowledgator/gliclass-large-v1.0-init
Zero-Shot Classification • 0.4B • Updated • 24 • 14 -
knowledgator/gliclass-base-v1.0-init
Zero-Shot Classification • 0.2B • Updated • 2 • 2 -
knowledgator/gliclass-small-v1.0-init
Zero-Shot Classification • 0.1B • Updated • 6 • 5
Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks.
-
UTC HandyLab
📚3 -
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25 -
knowledgator/UTC-DeBERTa-large-v2
Token Classification • 0.4B • Updated • 12 • 24 -
knowledgator/UTC-DeBERTa-base-v2
Token Classification • 0.2B • Updated • 41
Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit.
-
knowledgator/comprehend_it-base
Zero-Shot Classification • Updated • 7.26k • • 87 -
MoritzLaurer/deberta-v3-large-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.4B • Updated • 3.69k • • 58 -
MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7
Zero-Shot Classification • 0.3B • Updated • 224k • • 351 -
MoritzLaurer/deberta-v3-base-zeroshot-v1.1-all-33
Zero-Shot Classification • Updated • 17k • • 30
Knowledgator GLiNER models for information extraction
-
knowledgator/gliner-multitask-v1.0
Token Classification • Updated • 5.51k • 37 -
knowledgator/gliner-multitask-large-v0.5
Token Classification • Updated • 668 • 138 -
GLiNER HandyLab
⚡84Perform multiple NLP tasks on your text
-
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25
Collection of auto-regressive models tuned for text classification
Collection of models for converting chemical formats between each other.
-
knowledgator/SMILES2IUPAC-canonical-small
Text Generation • 5.97M • Updated • 9.81k • 7 -
knowledgator/IUPAC2SMILES-canonical-base
Text Generation • Updated • 2.22k • 6 -
knowledgator/IUPAC2SMILES-canonical-small
Text Generation • 5.79M • Updated • 6 • 5 -
knowledgator/SMILES2IUPAC-canonical-base
Text Generation • Updated • 24k • 9
Collection of pre-trained encoder models trained on large molecules databases.
Collection of datasest for various information extraction tasks.