logoJun's Blog
  • |
  • ๐Ÿ  Home
  • ๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ About
  • ๐Ÿ“š Posts
  • ๐Ÿงฉ Tags
  • โฑ๏ธ Archives
  • ๐Ÿ” Search
๐Ÿ  Home ยป ๐Ÿงฉ Tags

ML

Entropy Collapsing in RL Training

Decoding ...

2025-01-05    494 words    1 min    Jun    ML

Diffusion Probabilistic Models

Dive into diffusion ...

2024-06-18    404 words    1 min    Jun    ML

Loss Reduction

Losses in ML ...

2024-06-18    498 words    1 min    Jun    ML

Ray

Async Ops in Ray ...

2024-06-18    2287 words    5 min    Jun    ML

Monte Carlo Tree Search

MCTS ...

2024-04-05    907 words    2 min    Jun    ML

Autograd

PyTorch.. ...

2024-03-11    1362 words    3 min    Jun    ML

Coder Training

Large coder model pretraining ...

2023-12-18    411 words    1 min    Jun    LLM  ML

Model Evaluation

Model Evaluation

2023-10-18    102 words    1 min    Jun    ML

MoE Models

MoE Models ...

2023-10-18    987 words    2 min    Jun    ML

Retrieval Augmented Generation

RAG system ...

2023-10-18    1858 words    4 min    Jun    ML

Flash Attention

Large language model pretraining ...

2023-06-18    603 words    2 min    Jun    LLM  ML

Distributed Optimizer

Distributed optimizer ...

2023-05-05    673 words    2 min    Jun    ML

LoRA Model Fine-tuning

LoRA finetuning ...

2023-05-05    211 words    1 min    Jun    ML

ML System

ML System

2023-03-05    47 words    1 min    Jun    ML

InstructGPT and ChatGPT

RLHF ...

2023-01-18    574 words    2 min    Jun    LLM  ML


1 2

Copyright © 2020-2025 Jun's Blog All Rights Reserved