QCRI/LlamaLens
LlamaLens is a specialized multilingual large language model developed by QCRI for analyzing news and social media content. It is designed to perform 18 natural language processing tasks across Arabic, English, and Hindi, leveraging 52 distinct datasets. This model excels in tasks such as sentiment classification, hate speech detection, and news categorization, offering enhanced performance for content analysis in these languages.
Loading preview...
LlamaLens: Specialized Multilingual Content Analysis LLM
LlamaLens is a specialized multilingual large language model developed by QCRI, focusing on the analysis of news and social media content. It is uniquely designed to handle 18 distinct Natural Language Processing (NLP) tasks across three languages: Arabic, English, and Hindi. The model was trained on the comprehensive LlamaLens dataset, which incorporates 52 different datasets to achieve its specialized capabilities.
Key Capabilities
- Multilingual Processing: Proficient in Arabic, English, and Hindi for various NLP tasks.
- Broad Task Coverage: Addresses 18 NLP tasks including attentionworthiness, checkworthiness, claim, cyberbullying, emotion, factuality, harmfulness, hate speech, news categorization, news credibility, news summarization, offensive language, propaganda, sarcasm, sentiment, and stance detection.
- Performance: Demonstrates competitive performance, often surpassing or closely matching State-of-the-Art (SOTA) benchmarks on specific tasks and datasets, particularly with native language instructions.
Good for
- Social Media Monitoring: Analyzing sentiment, detecting hate speech, and identifying offensive content in social media streams.
- News Analysis: Categorizing news, assessing credibility, and detecting claims or propaganda.
- Multilingual NLP Applications: Developers working on applications requiring specialized content analysis in Arabic, English, or Hindi.