数据预处理

Revision as of 17:36, 14 March 2013 (view source)

Revision as of 17:44, 14 March 2013 (view source)

Line 73:

对于彩色图像，色彩通道间并不存在平稳特性。因此我们通常首先对数据进行特征缩放（使像素值位于<math>[0, 1]</math>区间），然后使用足够大的<tt>epsilon</tt>来做PCA/ZCA。注意在进行PCA变换前需要对特征进行分量均值归零化。

-

=== ~~Audio (MFCC/Spectrograms)/~~音频 (MFCC/频谱图) ===

+

=== 音频 (MFCC/频谱图) ===

-

~~【原文】~~

+

对于音频数据 (MFCC 和频谱图)，每一维度的取值范围（方差）不同。例如MFCC的第一分量是直流分量，通常其幅度远大于其他分量，尤其当特征中包含时域导数(temporal derivatives)时（这是音频处理中的常用方法）更是如此。因此，对这类数据的预处理通常从简单的数据标准化开始（即使得数据的每一维度均值为零、方差为1），然后进行PCA/ZCA白化（使用合适的<tt>epsilon</tt>）。

-

~~For audio data~~ (MFCC ~~and Spectrograms~~)~~, each dimension usually have different scales~~ (~~variances); the first component of MFCCs, for example, is the DC component and usually has a larger magnitude than the other components. This is especially so when one includes the~~ temporal derivatives ~~(a common practice in audio processing~~)~~. As a result, the preprocessing usually starts with simple data standardization (zero-mean, unit-variance per data dimension), followed by PCA~~/~~ZCA whitening (with an appropriate~~ <tt>epsilon</tt>).

+

-

~~【初译】~~

+

=== MNIST 手写数字 ===

-

对于音频数据 (MFCC 和频谱图)，每一维度的尺寸不同（方差）。例如FMCC的第一组件，作为直流成分通常其幅度会远比其他成分大，当包含时域微分时（音频处理中常用方法）尤为如此。结果是预处理通常从简单的数据标准化开始（是数据每一维度都呈零均值和单位方差），然后进行PCA/ZCA白化（使用合适的<tt>epsilon</tt>）。

+

MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将其缩放到<math>[0, 1]</math>区间。实际上，进行逐样本均值消去也有助于特征学习。''注：也可选择以对MNIST进行PCA/ZCA白化，但这在实践中不常用。''

-

+

-

~~【一审】~~

+

-

对于音频数据 (MFCC 和频谱图)，每一维度的尺度（方差）不同。例如MFCC的第一分量是直流分量，通常其幅度远大于其他分量，尤其当特征中包含时域导数时（这是音频处理中的常用方法）更是如此。因此，对这类数据的预处理通常从简单的数据标准化开始（即使得数据的每一维度均值为零、方差为1），然后使用合适的<tt>epsilon</tt>进行PCA/ZCA白化。

+

-

+

-

=== ~~MNIST Handwritten Digits/~~MNIST 手写数字 ===

+

-

~~【原文】~~

+

-

The MNIST dataset has pixel values in the range <math>[0, 255]</math>. We thus start with simple rescaling to shift the data into the range <math>[0, 1]</math>. In practice, removing the mean-value per example can also help feature learning. ''Note: While one could also elect to use PCA/ZCA whitening on MNIST if desired, this is not often done in practice.''

+

-

+

-

~~【初译】~~

+

-

MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将之重缩放到<math>[0, 1]</math>区间，实际情况中移除均值也有助于特征学习。''值得注意的是对MNIST进行PCA/ZCA白化也是可行的，但在实践中不常用。''

+

-

+

-

~~【一审】~~

+

-

MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将其缩放到<math>[0, 1]</math>~~区间。实际上，对这些数据做分量均值归零化也有助于特征学习。~~''~~注：也可以对MNIST进行PCA~~/ZCA白化，但这在实践中不常用。''

+

From Ufldl

Revision as of 17:44, 14 March 2013

Views

Personal tools

ufldl resources

wiki

Search

Toolbox

@@ Line 73: / Line 73: @@
 对于彩色图像，色彩通道间并不存在平稳特性。因此我们通常首先对数据进行特征缩放（使像素值位于<math>[0, 1]</math>区间），然后使用足够大的<tt>epsilon</tt>来做PCA/ZCA。注意在进行PCA变换前需要对特征进行分量均值归零化。
-=== Audio (MFCC/Spectrograms)/音频 (MFCC/频谱图) ===
+=== 音频 (MFCC/频谱图) ===
-【原文】
+对于音频数据 (MFCC 和频谱图)，每一维度的取值范围（方差）不同。例如MFCC的第一分量是直流分量，通常其幅度远大于其他分量，尤其当特征中包含时域导数(temporal derivatives)时（这是音频处理中的常用方法）更是如此。因此，对这类数据的预处理通常从简单的数据标准化开始（即使得数据的每一维度均值为零、方差为1），然后进行PCA/ZCA白化（使用合适的<tt>epsilon</tt>）。
-For audio data (MFCC and Spectrograms), each dimension usually have different scales (variances); the first component of MFCCs, for example, is the DC component and usually has a larger magnitude than the other components. This is especially so when one includes the temporal derivatives (a common practice in audio processing). As a result, the preprocessing usually starts with simple data standardization (zero-mean, unit-variance per data dimension), followed by PCA/ZCA whitening (with an appropriate <tt>epsilon</tt>).
-【初译】
+=== MNIST 手写数字 ===
-对于音频数据 (MFCC 和频谱图)，每一维度的尺寸不同（方差）。例如FMCC的第一组件，作为直流成分通常其幅度会远比其他成分大，当包含时域微分时（音频处理中常用方法）尤为如此。结果是预处理通常从简单的数据标准化开始（是数据每一维度都呈零均值和单位方差），然后进行PCA/ZCA白化（使用合适的<tt>epsilon</tt>）。
+MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将其缩放到<math>[0, 1]</math>区间。实际上，进行逐样本均值消去也有助于特征学习。''注：也可选择以对MNIST进行PCA/ZCA白化，但这在实践中不常用。''
-【一审】
-对于音频数据 (MFCC 和频谱图)，每一维度的尺度（方差）不同。例如MFCC的第一分量是直流分量，通常其幅度远大于其他分量，尤其当特征中包含时域导数时（这是音频处理中的常用方法）更是如此。因此，对这类数据的预处理通常从简单的数据标准化开始（即使得数据的每一维度均值为零、方差为1），然后使用合适的<tt>epsilon</tt>进行PCA/ZCA白化。
-=== MNIST Handwritten Digits/MNIST 手写数字 ===
-【原文】
-The MNIST dataset has pixel values in the range <math>[0, 255]</math>. We thus start with simple rescaling to shift the data into the range <math>[0, 1]</math>. In practice, removing the mean-value per example can also help feature learning. ''Note: While one could also elect to use PCA/ZCA whitening on MNIST if desired, this is not often done in practice.''
-【初译】
-MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将之重缩放到<math>[0, 1]</math>区间，实际情况中移除均值也有助于特征学习。''值得注意的是对MNIST进行PCA/ZCA白化也是可行的，但在实践中不常用。''
-【一审】
-MNIST数据集的像素值在 <math>[0, 255]</math>区间中。我们首先将其缩放到<math>[0, 1]</math>区间。实际上，对这些数据做分量均值归零化也有助于特征学习。''注：也可以对MNIST进行PCA/ZCA白化，但这在实践中不常用。''