Writer

Enterprise

company

Verified

https://writer.com/

Get_Writer

writer

Activity Feed

AI & ML interests

AGI, LLMs, Knowledge Graph, Palmyra, Domain Specific LLM

Recent Activity

zhenyu-writer updated a dataset 1 day ago

Writer/stock_analysis_public_eval_v1

zhenyu-writer published a dataset 1 day ago

Writer/stock_analysis_public_eval_v1

zhenyu-writer updated a dataset 2 days ago

Writer/financebench_multihop_qa_v1_filtered_tool_calls

View all activity

Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

Sep 11

•

zhenyu-writer

updated a dataset 1 day ago

Writer/stock_analysis_public_eval_v1

Viewer • Updated 1 day ago • 120 • 10

zhenyu-writer

published a dataset 1 day ago

Writer/stock_analysis_public_eval_v1

Viewer • Updated 1 day ago • 120 • 10

zhenyu-writer

updated a dataset 2 days ago

Writer/financebench_multihop_qa_v1_filtered_tool_calls

Viewer • Updated 2 days ago • 270 • 27

zhenyu-writer

published a dataset 2 days ago

Writer/financebench_multihop_qa_v1_filtered_tool_calls

Viewer • Updated 2 days ago • 270 • 27

zhenyu-writer

updated a dataset 2 days ago

Writer/Toucan_50k_openai_fixed_tool_calls_stringified

Viewer • Updated 2 days ago • 49.8k • 14

zhenyu-writer

published a dataset 2 days ago

Writer/Toucan_50k_openai_fixed_tool_calls_stringified

Viewer • Updated 2 days ago • 49.8k • 14

zhenyu-writer

updated a dataset 3 days ago

Writer/Toucan_1.5M_7500_fixed_tool_calls_consolidated_turns

Viewer • Updated 3 days ago • 7.5k • 8

zhenyu-writer

published a dataset 3 days ago

Writer/Toucan_1.5M_7500_fixed_tool_calls_consolidated_turns

Viewer • Updated 3 days ago • 7.5k • 8

zhenyu-writer

updated a dataset 3 days ago

Writer/Toucan_1.5M_7500_fixed_tool_calls

Viewer • Updated 3 days ago • 7.5k • 81

zhenyu-writer

updated a dataset 4 days ago

Writer/DA-Code-data

Viewer • Updated 4 days ago • 170 • 125

sanderland

authored a paper about 1 month ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28 • 16

dmytro-writer

authored a paper 6 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 277

sanderland

authored a paper 6 months ago

RewardBench 2: Advancing Reward Model Evaluation

Paper • 2506.01937 • Published Jun 2 • 7

wassemgtk

authored a paper 6 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 277

sanderland

authored a paper 7 months ago

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Paper • 2405.05417 • Published May 8, 2024 • 1

wassemgtk

posted an update 8 months ago

Post

3238

I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help?

Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb

1 reply

sanderland

authored a paper 8 months ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published Apr 1 • 29

wassemgtk

posted an update 9 months ago

Post

2133

For fun, a new project: SuperTokenizer! A BPE tokenizer trained on C4 to beat GPT-4. Byte-level, A100-powered, and open-source. Messing around with tokens!
https://github.com/wassemgtk/SuperTokenizer

1 reply

wassemgtk

posted an update 9 months ago

Post

1918

# GESAL: Real-Time Adaptation for LLMs

We’re excited to unveil **Graph-Enhanced Singular Adaptive Learning (GESAL)**, a framework that lets LLMs like meta-llama/Llama-3.2-1B adapt in real time using user feedback. Check out the code and white paper on GitHub!

🔗 **Code**: [https://github.com/writer/AI-Adaptive-Learning-GESAL](https://github.com/writer/AI-Adaptive-Learning-GESAL)

---

## Why GESAL?

Static LLMs struggle to adapt without heavy retraining. GESAL solves this with:
- **SVF**: Adapts weights via \( W' = U (\Sigma \cdot z) V^T \), using few parameters.
- **Graph Memory**: Stores adaptations in nodes for scalability.
- **RL**: Updates via \( J(z) = \mathbb{E}[\log \pi_z(y|x) r] \) based on feedback.

---

## How It Works

Ask "How many R’s in ‘strawberry’?" If it says "2" and you say "no," GESAL learns to say "3" next time, avoiding repeats.

---

## Try It

Built with Hugging Face’s transformers:

pip install transformers torch numpy
python Adaptive_Learning_(GESAL).py

Needs a Hugging Face token for Llama-3.2-1B.

---

## Results

GESAL hits 95% accuracy after 5 feedbacks vs. LoRA’s 70%. It’s efficient (~0.5M params) and scalable.

15 replies

wassemgtk

authored a paper 10 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 133

AI & ML interests

Recent Activity

Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

Team members 176

Writer's activity