logoJun's Blog
  • |
  • ๐Ÿ  Home
  • ๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ About
  • ๐Ÿ“š Posts
  • ๐Ÿงฉ Tags
  • โฑ๏ธ Archives
  • ๐Ÿ” Search
๐Ÿ  Home ยป ๐Ÿ“šArticles

๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป Tech

Distillation

Knowledge distillation ...

2025-01-05    823 words    2 min    Jun    LLM

Entropy Collapsing in RL Training

Decoding ...

2025-01-05    494 words    1 min    Jun    ML

vLLM

VLLM ...

2024-10-05    1200 words    3 min    Jun    Inference

AsyncIO

Async concurrency ...

2024-08-09    1299 words    3 min    Jun    Python

veRL

RL training framework ...

2024-08-05    474 words    1 min    Jun    RL

PPO and Its Implementation

Proximal Policy Optimization ...

2024-07-05    2025 words    5 min    Jun    RL

Diffusion Probabilistic Models

Dive into diffusion ...

2024-06-18    404 words    1 min    Jun    ML

Loss Reduction

Losses in ML ...

2024-06-18    498 words    1 min    Jun    ML

Ray

Async Ops in Ray ...

2024-06-18    2287 words    5 min    Jun    ML

VQ-VAE

Multimodality ...

2024-05-18    685 words    2 min    Jun    Multimodality

Pipe in Multiprocessing

Python ...

2024-04-09    1169 words    3 min    Jun   

Monte Carlo Tree Search

MCTS ...

2024-04-05    907 words    2 min    Jun    ML

Autograd

PyTorch.. ...

2024-03-11    1362 words    3 min    Jun    ML

Whisper Model

Whisper Model ...

2024-03-05    276 words    1 min    Jun    LLM

RPC in Torch

PyTorch.. ...

2024-01-11    418 words    1 min    Jun    Infra


1 2  ...  4

Copyright © 2020-2025 Jun's Blog All Rights Reserved