|
222 | 222 | - 所以**◆◆**最终给出初始化权重的方法为: |
223 | 223 | ![$$W \sim U[ - {{\sqrt 6 } \over {\sqrt {{n_i} + {n_{i + 1}}} }},{{\sqrt 6 } \over {\sqrt {{n_i} + {n_{i + 1}}} }}]$$](http://latex.codecogs.com/gif.latex?%5Clarge%20%24%24W%20%5Csim%20U%5B%20-%20%7B%7B%5Csqrt%206%20%7D%20%5Cover%20%7B%5Csqrt%20%7B%7Bn_i%7D%20+%20%7Bn_%7Bi%20+%201%7D%7D%7D%20%7D%7D%2C%7B%7B%5Csqrt%206%20%7D%20%5Cover%20%7B%5Csqrt%20%7B%7Bn_i%7D%20+%20%7Bn_%7Bi%20+%201%7D%7D%7D%20%7D%7D%5D%24%24) |
224 | 224 | - 这就是**Xavier初始化**方法 |
| 225 | + |
| 226 | +------------------------------------------------------ |
| 227 | + |
| 228 | +## 三、权重初始化问题2_`ReLu`激励函数 |
| 229 | +### 1、`ReLu/PReLu`激励函数 |
| 230 | +- 目前`ReLu`激活函数使用比较多,而上面一篇论文没有讨论,如果还是使用同样初始化权重的方法(**Xavier初始化**)会有问题 |
| 231 | +- PReLu函数定义如下: |
| 232 | + - ![enter description here][22] |
| 233 | + - 等价于: |
| 234 | +- ReLu(左)和PReLu(右)激活函数图像 |
| 235 | +![enter description here][23] |
| 236 | + |
| 237 | +### 2、前向传播推导 |
| 238 | +- 符号说明 |
| 239 | + - ε…………………………………目标函数 |
| 240 | + - μ…………………………………动量 |
| 241 | + - α…………………………………学习率 |
| 242 | + - f()………………………………激励函数 |
| 243 | + - l……………………………………当前层 |
| 244 | + - L……………………………………神经网络总层数 |
| 245 | + - k……………………………………过滤器filter的大小 |
| 246 | + - c……………………………………输入通道个数 |
| 247 | + - x……………………………………k2c*1的向量 |
| 248 | + - d……………………………………过滤器filter的个数 |
| 249 | + - b……………………………………偏置向量 |
| 250 | +- .............................(1) |
| 251 | + |
| 252 | +### 3、 |
| 253 | + |
| 254 | + |
| 255 | + |
| 256 | + |
| 257 | + |
| 258 | + |
| 259 | + |
| 260 | + |
| 261 | + |
| 262 | + |
| 263 | + |
| 264 | + |
| 265 | + |
| 266 | + |
| 267 | + |
| 268 | + |
| 269 | + |
| 270 | + |
| 271 | + |
| 272 | + |
| 273 | + |
| 274 | + |
| 275 | + |
| 276 | + |
| 277 | + |
| 278 | + |
| 279 | + |
| 280 | + |
| 281 | + |
225 | 282 |
|
| 283 | + |
| 284 | + |
226 | 285 | [1]: ./images/CNN_01.gif "CNN_01.gif" |
227 | 286 | [2]: ./images/CNN_02.gif "CNN_02.gif" |
228 | 287 | [3]: ./images/CNN_03.png "CNN_03.png" |
|
243 | 302 | [18]: ./images/Weights_initialization_02.png "Weights_initialization_02.png" |
244 | 303 | [19]: ./images/Weights_initialization_03.png "Weights_initialization_03.png" |
245 | 304 | [20]: ./images/Weights_initialization_04.png "Weights_initialization_04.png" |
246 | | - [21]: ./images/Weights_initialization_05.png "Weights_initialization_05.png" |
| 305 | + [21]: ./images/Weights_initialization_05.png "Weights_initialization_05.png" |
| 306 | + [22]: ./images/Weights_initialization_06.png "Weights_initialization_06.png" |
| 307 | + [23]: ./images/Weights_initialization_07.png "Weights_initialization_07.png" |
0 commit comments