logoJun's Blog
  • |
  • ๐Ÿ  Home
  • ๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ About
  • ๐Ÿ“š Posts
  • ๐Ÿงฉ Tags
  • โฑ๏ธ Archives
  • ๐Ÿ” Search
๐Ÿ  Home ยป ๐Ÿ“šArticles

๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป Tech

MoE Models

MoE Models ...

2023-10-18    987 words    2 min    Jun    ML

Retrieval Augmented Generation

RAG system ...

2023-10-18    1858 words    4 min    Jun    ML

Flash Attention

Large language model pretraining ...

2023-06-18    603 words    2 min    Jun    LLM  ML

Data Processing in Distributed Training

Distributed data processing ...

2023-05-05    952 words    2 min    Jun    Distributed Training

Distributed Optimizer

Distributed optimizer ...

2023-05-05    673 words    2 min    Jun    ML

LoRA Model Fine-tuning

LoRA finetuning ...

2023-05-05    211 words    1 min    Jun    ML

ML System

ML System

2023-03-05    47 words    1 min    Jun    Blog

InstructGPT and ChatGPT

RLHF ...

2023-01-18    574 words    2 min    Jun    LLM  ML

Large Scale Pretraining

Large language model pretraining ...

2022-12-18    2203 words    5 min    Jun    LLM  ML

Kubernetes

Kubenetes is everywhere. ...

2022-12-08    652 words    2 min    Jun   

ANTLR Parser Generator

How to use antlr framework to generate code parser ...

2022-08-08    428 words    1 min    Jun    Parsing

Parallelism in LLM Training

Parallelism in LLM training ...

2022-07-08    1118 words    3 min    Jun   

Pytorch Multiple-GPU Training

PyTorch.. ...

2022-06-11    1053 words    3 min    Jun    Blog

Distributed Training Infra

Distributed training infrastructure ...

2022-05-05    668 words    2 min    Jun    Blog

A Walk in the Cloud

AWS ...

2022-05-05    1104 words    3 min    Jun    Tech


1 2 3  ...  4

Copyright ยฉ 2020-2025 Jun's Blog All Rights Reserved   1792 4352