博客
关于我
学习笔记之——Normalization
阅读量:515 次
发布时间:2019-03-07

本文共 3662 字,大约阅读时间需要 12 分钟。

Normalization Methods in Deep Learning

Normalization techniques are crucial in deep learning, particularly for training deep neural networks efficiently. These techniques help address issues like Internal Covariate Shift, which can hinder training due to varying input distributions across batches. Here, we explore key normalization methods and their applications.

Batch Normalization (BN)

Batch Normalization (BN) was introduced to address the Internal Covariate Shift issue. By normalizing each batch to have a mean of 0 and variance of 1, BN stabilizes the input distribution, preventing gradient issues like vanishing or exploding gradients. BN introduces learnable parameters β and γ to shift and scale the normalized values, allowing the model to adapt to different data distributions.

Advantages of BN

  • Reduces Internal Covariate Shift, accelerating training.
  • Mitigates gradient issues, especially with activation functions like sigmoid or tanh.
  • Less sensitive to initial parameter values, enhancing training stability.

Disadvantages of BN

  • Batch size sensitivity: Small batch sizes lead to inaccurate mean and variance estimates.
  • Requires storing batch statistics during training, which can be memory-intensive for large batches.
  • Not suitable for RNNs due to varying sequence lengths.

Layer Normalization (LN)

Layer Normalization (LN) normalizes all features in a layer across the batch. Unlike BN, LN does not depend on batch size, making it suitable for small batches. It computes mean and variance across the entire layer, ensuring consistent normalization regardless of batch size. This makes LN more versatile, particularly for RNNs where sequence lengths vary.

Differences Between BN and LN

  • BN normalizes features per batch, while LN normalizes features across the entire layer.
  • LN does not require storing batch statistics, saving memory.

Weight Normalization (WN)

Weight Normalization (WN) normalizes filter weights rather than features. It decouples weights into magnitude (beta) and direction (gamma), allowing independent training of these parameters. Unlike BN and LN, WN does not depend on input data distribution, making it a form of parameter normalization.

Instance Normalization (IN)

Instance Normalization (IN) is ideal for tasks like image style transfer, where the model's output depends on individual image instances rather than the entire batch. IN normalizes features within each image instance, ensuring consistent output while allowing diverse styles across instances.

Group Normalization (GN)

Group Normalization (GN) addresses the limitations of BN by grouping features along the channel dimension. Each group is normalized separately, reducing the impact of batch size on normalization accuracy. GN is particularly effective for small batches and can be seen as a middle ground between BN and LN.

Key Characteristics of GN

  • Divides features into groups and normalizes each group separately.
  • Reduces dependency on batch size, making it suitable for small batches.
  • Combines aspects of BN and LN, providing a flexible normalization approach.

Switchable Normalization (SN)

Switchable Normalization (SN) allows each layer to choose the most appropriate normalization method dynamically during training. This adaptability addresses the limitation of fixed normalization methods, offering a solution for diverse applications without manual tuning.

Summary

Normalization techniques like BN, LN, WN, IN, GN, and SN each address specific challenges in deep learning. Understanding their unique properties and applications can help choose the optimal method for a given task, ensuring efficient and stable training of deep neural networks.

转载地址:http://tiajz.baihongyu.com/

你可能感兴趣的文章
【docker知识】联合文件系统(unionFS)原理
查看>>
ORACEL学习--理解over()函数
查看>>
oracle 10g的安装配置
查看>>
Oracle 11g 使用RMAN备份数据库
查看>>
Oracle 11gR2学习之二(创建数据库及OEM管理篇)
查看>>
Oracle 11g中的snapshot standby特性
查看>>
Oracle 11g忘记sys、system、scott密码该这样修改!
查看>>
Oracle 11g数据库安装和卸载教程
查看>>
Oracle 11g超详细安装步骤
查看>>
ORACLE Bug 4431215 引发的血案—原因分析篇
查看>>
oracle dblink 创建使用 垮库转移数据
查看>>
oracle dblink结合同义词的用法 PLS-00352:无法访问另一数据库
查看>>
Oracle dbms_job.submit参数错误导致问题(ora-12011 无法执行1作业)
查看>>
oracle dg switchover,DG Switchover fails
查看>>
Oracle EBS-SQL (BOM-15):检查多层BOM(含common BOM).sql
查看>>
Oracle EBS环境下查找数据源(OAF篇)
查看>>
Oracle GoldenGate Director安装和配置(无图)
查看>>
oracle ogg 单实例双向复制搭建(oracle-oracle)--Oracle GoldenGate
查看>>
oracle scott趣事
查看>>
oracle script
查看>>