File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -140,7 +140,8 @@ <h4>✨ <a href="optimizers/index.html">Optimizers</a></h4>
140140< li > < a href ="optimizers/adam_warmup.html "> Adam Optimizer with warmup</ a > </ li >
141141< li > < a href ="optimizers/noam.html "> Noam Optimizer</ a > </ li >
142142< li > < a href ="optimizers/radam.html "> Rectified Adam Optimizer</ a > </ li >
143- < li > < a href ="optimizers/ada_belief.html "> AdaBelief Optimizer</ a > </ li > </ ul >
143+ < li > < a href ="optimizers/ada_belief.html "> AdaBelief Optimizer</ a > </ li >
144+ < li > < a href ="optimizers/sophia.html "> Sophia-G Optimizer</ a > </ li > </ ul >
144145< h4 > ✨ < a href ="normalization/index.html "> Normalization Layers</ a > </ h4 >
145146< ul > < li > < a href ="normalization/batch_norm/index.html "> Batch Normalization</ a > </ li >
146147< li > < a href ="normalization/layer_norm/index.html "> Layer Normalization</ a > </ li >
Load Diff Large diffs are not rendered by default.
Original file line number Diff line number Diff line change @@ -77,7 +77,8 @@ <h2>Optimizer Implementations</h2>
7777< li > < a href ="https://nn.labml.ai/optimizers/adam_warmup.html "> Adam Optimizer with warmup</ a > </ li >
7878< li > < a href ="https://nn.labml.ai/optimizers/noam.html "> Noam Optimizer</ a > </ li >
7979< li > < a href ="https://nn.labml.ai/optimizers/radam.html "> Rectified Adam Optimizer</ a > </ li >
80- < li > < a href ="https://nn.labml.ai/optimizers/ada_belief.html "> AdaBelief Optimizer</ a > </ li > </ ul >
80+ < li > < a href ="https://nn.labml.ai/optimizers/ada_belief.html "> AdaBelief Optimizer</ a > </ li >
81+ < li > < a href ="https://nn.labml.ai/optimizers/sophia.html "> Sophia-G Optimizer</ a > </ li > </ ul >
8182
8283 </ div >
8384 < div class ='code '>
Original file line number Diff line number Diff line change 109109* [Noam Optimizer](optimizers/noam.html)
110110* [Rectified Adam Optimizer](optimizers/radam.html)
111111* [AdaBelief Optimizer](optimizers/ada_belief.html)
112+ * [Sophia-G Optimizer](optimizers/sophia.html)
112113
113114#### ✨ [Normalization Layers](normalization/index.html)
114115* [Batch Normalization](normalization/batch_norm/index.html)
Original file line number Diff line number Diff line change 1515* [Noam Optimizer](noam.html)
1616* [Rectified Adam Optimizer](radam.html)
1717* [AdaBelief Optimizer](ada_belief.html)
18+ * [Sophia-G Optimizer](sophia.html)
1819
1920This [MNIST example](mnist_experiment.html) uses these optimizers.
2021
Original file line number Diff line number Diff line change 77* [ Noam Optimizer] ( https://nn.labml.ai/optimizers/noam.html )
88* [ Rectified Adam Optimizer] ( https://nn.labml.ai/optimizers/radam.html )
99* [ AdaBelief Optimizer] ( https://nn.labml.ai/optimizers/ada_belief.html )
10+ * [ Sophia-G Optimizer] ( https://nn.labml.ai/optimizers/sophia.html )
Original file line number Diff line number Diff line change @@ -106,6 +106,7 @@ Solving games with incomplete information such as poker with CFR.
106106* [ Noam Optimizer] ( https://nn.labml.ai/optimizers/noam.html )
107107* [ Rectified Adam Optimizer] ( https://nn.labml.ai/optimizers/radam.html )
108108* [ AdaBelief Optimizer] ( https://nn.labml.ai/optimizers/ada_belief.html )
109+ * [ Sophia-G Optimizer] ( https://nn.labml.ai/optimizers/sophia.html )
109110
110111#### ✨ [ Normalization Layers] ( https://nn.labml.ai/normalization/index.html )
111112* [ Batch Normalization] ( https://nn.labml.ai/normalization/batch_norm/index.html )
You can’t perform that action at this time.
0 commit comments