logoJun's Blog
  • |
  • ๐Ÿ  Home
  • ๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ About
  • ๐Ÿ“š Posts
  • ๐Ÿงฉ Tags
  • โฑ๏ธ Archives
  • ๐Ÿ” Search
๐Ÿ  Home ยป ๐Ÿ“šArticles

๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป Tech

HyperConnection, mHC

Architecture ...

2025-12-05    2373 words    5 min    Jun    LLM

PD Disaggregation

Inference ...

2025-04-05    425 words    1 min    Jun    LLM

Scaling LLMs in Depth

Architecture ...

2025-03-05    767 words    2 min    Jun    LLM

MoE Models

MoE Models ...

2025-02-18    1487 words    3 min    Jun    ML

Distillation

Knowledge distillation ...

2025-01-05    834 words    2 min    Jun    LLM

Entropy Collapsing in RL Training

Decoding ...

2025-01-05    494 words    1 min    Jun    ML

JAX

JAX ...

2024-11-18    825 words    2 min    Jun    ML

Flux

Efficient Training ...

2024-10-10    1476 words    3 min    Jun    AI Infra

vLLM

VLLM ...

2024-10-05    1252 words    3 min    Jun    Inference

AsyncIO

Async concurrency ...

2024-08-09    1545 words    4 min    Jun    Python

veRL

RL training framework ...

2024-08-05    474 words    1 min    Jun    RL

PPO and Its Implementation

Proximal Policy Optimization ...

2024-07-05    2025 words    5 min    Jun    RL

Diffusion Probabilistic Models

Dive into diffusion ...

2024-06-18    404 words    1 min    Jun    ML

Loss Reduction

Losses in ML ...

2024-06-18    843 words    2 min    Jun    ML

Ray

Async Ops in Ray ...

2024-06-18    2398 words    5 min    Jun    ML


1 2  ...  4

Copyright © 2020-2026 Jun's Blog All Rights Reserved