
WiG: Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network
Block Diagram

Abstract
An activation function has crucial role in a deep neural network.
A simple rectified linear unit (ReLU) are widely used for the activation function.
In this paper, a weighted sigmoid gate unit (WiG) is proposed as the activation function.
The proposed WiG consists of a multiplication of inputs and the weighted sigmoid gate.
It is shown that the WiG includes the ReLU and same activation functions as a special case.
Many activation functions have been proposed to overcome the performance of the ReLU.
In the literature, the performance is mainly evaluated with an object recognition task.
The proposed WiG is evaluated with the object recognition task and the image restoration task.
Then, the expeirmental comparisons demonstrate the proposed WiG overcomes the existing activation functions including the ReLU.
Evaluations
Comparison of the Validation Accuracy
|
cifar-10 |
cifar-100 |
ReLU |
0.927 |
0.653 |
selu |
0.899 |
0.572 |
ELU |
0.903 |
0.550 |
softplus |
0.908 |
0.598 |
Leaky ReLU |
0.918 |
0.673 |
SiL |
0.919 |
0.638 |
PReLU |
0.935 |
0.678 |
Swish |
0.935 |
0.689 |
WiG (Pro.) |
0.949 |
0.742 |
Comparison of the Denoising Results
Average PSNR
Noise Level |
5 |
10 |
15 |
20 |
25 |
30 |
ReLU |
37.13 |
33.86 |
32.02 |
30.76 |
29.77 |
28.96 |
selu |
36.89 |
33.68 |
31.88 |
30.60 |
29.60 |
28.79 |
ELU |
37.07 |
33.63 |
31.86 |
30.60 |
29.65 |
28.86 |
softplus |
36.41 |
33.42 |
31.67 |
30.45 |
29.50 |
28.69 |
Leaky ReLU |
37.07 |
33.81 |
31.97 |
30.68 |
29.69 |
28.89 |
Swish |
37.13 |
33.70 |
31.91 |
30.69 |
29.72 |
28.93 |
WiG (Pro.) |
37.29 |
34.00 |
32.16 |
30.88 |
29.90 |
29.10 |
Average SSIM
Noise Level |
5 |
10 |
15 |
20 |
25 |
30 |
ReLU |
0.9383 |
0.8971 |
0.8646 |
0.8378 |
0.8142 |
0.7932 |
selu |
0.9355 |
0.8929 |
0.8607 |
0.8328 |
0.8084 |
0.7855 |
ELU |
0.9362 |
0.8910 |
0.8601 |
0.8331 |
0.8094 |
0.7888 |
softplus |
0.9337 |
0.8878 |
0.8547 |
0.8274 |
0.8041 |
0.7822 |
Leaky ReLU |
0.9380 |
0.8959 |
0.8628 |
0.8350 |
0.8104 |
0.7888 |
Swish |
0.9377 |
0.8928 |
0.8621 |
0.8370 |
0.8137 |
0.7928 |
WiG (Pro.) |
0.9390 |
0.8993 |
0.8679 |
0.8412 |
0.8188 |
0.7981 |
Publication
Masayuki Tanaka,
Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network,
Pattern Recognition Letters, Vol. 135, pp. 354-359, Jul. 2020. [PRL]
Masayuki Tanaka,
Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network,
arXiv preprint arXiv:1810.01829, 2018. [arXiv]
Code
Links
Copyright© Okutomi-Tanaka Lab All Rights Reserved.