MOHAMMED ABDALLAH's picture

57 402

MOHAMMED ABDALLAH

melsiddieg

·

melsiddieg

AI & ML interests

biomedical nlp, knowledge graphs, genomics

Recent Activity

published a dataset 39 minutes ago

melsiddieg/qari-arabic-ocr-10k

updated a dataset about 1 hour ago

melsiddieg/qari-arabic-ocr-10k

liked a model about 11 hours ago

nvidia/nemotron-ocr-v1

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3 • 24

upvoted 2 collections 3 months ago

MultiCaRe

MultiCaRe: Open-Source Clinical Case Dataset • 4 items • Updated Sep 25 • 16

FastVLM

Efficient Vision Encoding for Vision Language Models • 9 items • Updated Sep 2 • 104

upvoted an article 6 months ago

Article

Introducing the SQL Console on Datasets

Sep 17, 2024

•

24

upvoted a collection 6 months ago

SARD: Synthetic Arabic Recognition Dataset

A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts • 2 items • Updated May 19 • 5

upvoted a collection 9 months ago

BD3-LMs

https://m-arriola.com/bd3lms/ • 4 items • Updated Sep 2 • 27

upvoted a paper 10 months ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 23

upvoted a collection 11 months ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 25

upvoted a paper over 1 year ago

Med42-v2: A Suite of Clinical LLMs

Paper • 2408.06142 • Published Aug 12, 2024 • 52

upvoted a collection over 1 year ago

FalconMamba 7B

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated about 1 month ago • 34

upvoted an article over 1 year ago

Article

Welcome Falcon Mamba: The first strong attention-free 7B model

+4

Aug 12, 2024

•

113

upvoted 2 papers over 1 year ago

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Paper • 2407.16607 • Published Jul 23, 2024 • 23

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

upvoted an article over 1 year ago

Article

Introducing the Open Arabic LLM Leaderboard

+2

May 14, 2024

•

101

upvoted 2 papers over 1 year ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

upvoted a collection almost 2 years ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 89

upvoted 3 papers almost 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 626

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 134

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Paper • 2402.08609 • Published Feb 13, 2024 • 36