File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -302,4 +302,4 @@ <h3>Double <span ><span class="katex"><span aria-hidden="true" class="katex-html
302302 handleImages ( )
303303</ script >
304304</ body >
305- </ html >
305+ </ html >
Original file line number Diff line number Diff line change 554554
555555 <url >
556556 <loc >https://nn.labml.ai/diffusion/stable_diffusion/latent_diffusion.html</loc >
557- <lastmod >2022-09-15T16 :30:00+00:00</lastmod >
557+ <lastmod >2022-12-21T16 :30:00+00:00</lastmod >
558558 <priority >1.00</priority >
559559 </url >
560560
11771177
11781178 <url >
11791179 <loc >https://nn.labml.ai/transformers/mha.html</loc >
1180- <lastmod >2022-09-07T16 :30:00+00:00</lastmod >
1180+ <lastmod >2022-12-24T16 :30:00+00:00</lastmod >
11811181 <priority >1.00</priority >
11821182 </url >
11831183
Original file line number Diff line number Diff line change @@ -51,7 +51,7 @@ class QFuncLoss(Module):
5151 ### Target network 🎯
5252 In order to improve stability we use experience replay that randomly sample
5353 from previous experience $U(D)$. We also use a Q network
54- with a separate set of paramters $\t extcolor{orange}{\t heta_i^{-}}$ to calculate the target.
54+ with a separate set of parameters $\t extcolor{orange}{\t heta_i^{-}}$ to calculate the target.
5555 $\t extcolor{orange}{\t heta_i^{-}}$ is updated periodically.
5656 This is according to paper
5757 [Human Level Control Through Deep Reinforcement Learning](https://deepmind.com/research/dqn/).
You can’t perform that action at this time.
0 commit comments