본문으로 건너뛰기
AI Trends
피드
트렌딩
콜로세움
로그인
피드
트렌딩
콜로세움
학습/파인튜닝
학습/파인튜닝 관련 AI 뉴스를 한국어로 요약해 드립니다.
LoRA
79
Reinforcement Learning
69
RLHF
54
PPO
34
GRPO
26
Distillation
18
RLVR
13
DPO
11
DQN
10
SFT
9
Fine-tuning
9
SAC
8
Self-Supervised Learning
7
Contrastive Learning
6
Imitation Learning
6
Active Learning
6
Zero-shot Learning
5
Sim-to-Real
5
Online Learning
5
Bayesian Inference
5
Continual Learning
5
Flow Matching
4
Federated Learning
4
Variational Inference
4
Behavior Cloning
4
EMA
4
Multimodal RL
4
FSDP
4
MARL
3
HACRL
3
Self-distillation
3
ArcFace
3
Test-Time Training
3
HACPO
3
Distributed Training
3
Recursive Self-Improvement
2
Digital Red Queen
2
RLVF
2
SoftDTW
2
Physics-informed ML
2
μP
2
DiLoCo
2
Nova Forge
2
Curriculum Learning
2
Transfer Learning
2
Pretraining
2
AlphaZero
2
SPoT
2
LIPPAX
2
Gumbel Softmax
2
MemexRL
2
Nested Learning
2
SDPO
2
Pre-training
2
Contextual Bandit
2
Q-Learning
2
Gradient Accumulation
2
IMPALA
2
Self-play
2
TD3
2
DRQ
1
MEMIT
1
Post-training
1
Machine Learning
1
Attention Z-Reg
1
Scaled Reinforcement Learning
1
LSVI-UCB
1
Policy Gradient
1
Training Dynamics
1
POET-X
1
Decentralized Training
1
Boosting
1
PRM
1
Transductive Learning
1
Boosted Control Function
1
SGD
1
Causal RL
1
OmniDPO
1
Universal Self-Improvement
1
InfoPO
1
Policy Conditioning
1
SRL
1
NCE
1
DSDR
1
TBPTT
1
BetaZero
1
ConstrainedZero
1
InfoNCE
1
Reverse-KL Divergence
1
Value Iteration
1
RFT
1
Noise Contrastive Estimation
1
Empirical Likelihood
1
Symplectic Euler Method
1
Bilinear Game
1
Instruction Selection
1
Next-Token Prediction
1
Optimization
1
Confession
1
Backpropagation
1
He Initialization
1
Complete(d)P
1
Evolution Strategies
1
Complete(d)P
1
Reward Model
1
VESPO
1
Word2Vec
1
zElo
1
Adversarial Evolution
1
Adaptive Gradient Clipping
1
Selective Learning
1
LoKR
1
μP
1
Oversampling
1
CLaaS
1
Local Extra SGD
1
RLAIF
1
TTRL
1
LESS
1
Nonlinear Regression
1
Coreset Selection
1
BetaZero
1
ReGFT
1
DAPO
1
Stability Metric Φ
1
GAIL
1
Policy Optimization
1
Concept Data Attribution
1
Evolutionary Strategies
1
InSight
1
Pseudo Labeling
1
ELFS
1
Quantization-Aware Training
1
REINFORCE
1
LK Loss
1
Continuous Learning
1
Dueling DDQN
1
Cross-modal Distillation
1
Regression
1
DAgger
1
DPE
1
A3C
1
Multi-Task Learning
1
GSPO
1
Continuous Online Learning
1
SAMPO
1
EWC
1
Alternating Mirror Descent
1
Self Forcing
1
Data Augmentation
1
Dropout
1
Teacher Forcing
1
학습/파인튜닝 관련 모든 뉴스 보기 →