131 33 431

Jeonghwan Park PRO

maywell

https://www.linkedin.com/in/jeonghwan-park-6b97b1245

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

Tongyi-MAI/Z-Image-Turbo

upvoted a paper 13 days ago

Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel

liked a model 20 days ago

Human-CentricAI/LLM-Refusal-Classifier

View all activity

Organizations

upvoted a paper 13 days ago

Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel

Paper • 2508.18224 • Published Aug 25 • 1

upvoted a paper 20 days ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published 25 days ago • 68

upvoted a paper about 1 month ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10 • 81

upvoted an article 2 months ago

Article

Vocabulary is the most important element of Sparse Retrieval

Oct 4

•

upvoted an article 3 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

175

upvoted an article 4 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9

•

upvoted a paper 9 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 65

upvoted a paper 12 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted 2 papers about 1 year ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 38

upvoted an article about 1 year ago

Article

Navigating Korean LLM Research #1: Models

Oct 22, 2024

•

upvoted a paper about 1 year ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14, 2024 • 51

upvoted a collection about 1 year ago

Gemma-APS Release

Collection

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Jul 10 • 22

upvoted an article about 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

272

upvoted 2 articles over 1 year ago

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Aug 22, 2024

•

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

261

upvoted a paper over 1 year ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 82

upvoted an article over 1 year ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

upvoted 2 papers over 1 year ago

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 33

Aligning to Thousands of Preferences via System Message Generalization

Paper • 2405.17977 • Published May 28, 2024 • 7

Jeonghwan Park PRO

AI & ML interests

Recent Activity

Organizations

maywell's activity

Vocabulary is the most important element of Sparse Retrieval

Training and Finetuning Reranker Models with Sentence Transformers v4

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Navigating Korean LLM Research #1: Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Training and Finetuning Embedding Models with Sentence Transformers v3

Putting RL back in RLHF