librosa write_wav in mono?
问题描述:
我想重新采样在40.000赫兹到44100赫兹录制的单声道录音。librosa write_wav in mono?
下面的作品,但librosa代码似乎立体声保存,使文件的两倍这是没有必要的规模和我有很多的样品进行处理的。
所以我需要保存结果单。
代码:
# resampling a .wav file to a specific sample rate
import os
import librosa
import resampy
# this is the sample reate we want
sr_target = 44100
directory_in_str = '/home/hugo/test/'
directory = os.fsencode(directory_in_str)
for file in os.listdir(directory):
filename = os.fsdecode(file)
if filename.endswith(".wav"):
file_path = os.path.join(directory_in_str, filename)
print(file_path)
# Load in librosa's example audio file at its native sampling rate
x, sr_orig = librosa.load(file_path, mono=True, sr=None)
print("Original sample rate is : ", sr_orig)
# x is now a 1-d numpy array, with `sr_orig` audio samples per second
# We can resample this to any sampling rate we like, say 16000 Hz
y = resampy.resample(x, sr_orig, sr_target)
file_path_new = os.path.join(directory_in_str+'new/', filename)
# write it back
librosa.output.write_wav(file_path_new, y, sr_target)
continue
else:
continue
问:我想保存单重采样文件,我得到立体声,没有选项只保存单...
答
输出单声道或立体声取决于y
。如果y
具有(n,)的形状,则输出是单声道;如果y
具有(2,n)的形状,则输出是立体声。 librosa.output.write_wav
不会自动将单声道信号转换为立体声。
从你的代码,你的输出音频似乎是一个立体声音频。该文件大小的两倍并不意味着它是立体声。它可能是由输入和输出音频的不同数据类型引起的。