반응형
What is Learning Rate?
- Learning rate is a crucial hyperparameter in machine learning that controls the degree of updates on the model's weights.
- Generally, a loss function represents a hyperplane(초평면) in multi-dimensional space characterized by complex curvatures.
- If the weight updates are too large and overshoot the global minimum(the point where the loss function could be minimized), it becomes impossible to find the optimal solution(optimal weights and biases).
- If the learning rate is too high, the weight cahnges can be too large, causing it to overshoot the global minimum.
- When the learning rate is appropriate, weight changes are stable, allowing the model to efficiently reach the global minimum.
- Therefore, it's necessary to adjust the amount of weight updates to avoid missing the global minimum.
Reference:
Learn to embrace failure, for it is in those moments that you learn the most.- Max Holloway -
반응형
'캐글' 카테고리의 다른 글
[Kaggle Study] 5. Regularization 가중치 규제 (2) | 2024.10.30 |
---|---|
[Kaggle Study] 4. Overfitting, Underfitting, Variance and Bias (2) | 2024.10.29 |
[Kaggle Study] 2. Scale of Features (1) | 2024.10.29 |
[Kaggle Extra Study] 9. Plots with Missing Data (3) | 2024.10.28 |
[Kaggle Extra Study] 8. Imputation Techniques for Time Series Data (0) | 2024.10.27 |