搜索资源列表
LPCshare
- 语音信号处理,其中A/B Tool可以提供语音文件的‘A/B’比较,详情请见文件夹‘abtool_1.2 中的’readme文件,其中详细描述了该程序包可以实现的功能。可以免去手工编写同样功能代码的麻烦。还请站长审核!期待您的通知!-voice signal processing, A / B Tool can provide voice document 'A / B', see details folders' abtool_1.2 the 'readme f
soundchange
- 对两个声音进行傅利叶变换,利用A声音的相位和B声音的幅度进行组合,合成一个新的声音-two voices right for Fourier transform, using the voice of the A and B phase margin voice of the portfolio, Synthesis of a new voice
Energynormalization
- Speech Recognition - Numbers 1 to 5 Energy normalization and time alignment References: [1] L. Rabiner and B.H. Juang,Fundamentals of Speech Recognition, Prentice-Hall, 1993. % [2] P.E. Papamichalis, Practical Approaches to Speech Coding, Pre
EnergyNormalizationCepSpec
- Speech Recognition - Numbers 1 to 5 Energy normalization and time alignment References: [1] L. Rabiner and B.H. Juang,Fundamentals of Speech Recognition, Prentice-Hall, 1993. % [2] P.E. Papamichalis, Practical Approaches to Speech Coding, Pre
EZ-B-SDK-Windows-v2011.10.03.00
- 机器人智能控制C#源代码,包括图像跟踪,语音识别C#工程-Intelligent robot control C# source code, including the image tracking, speech recognition, the C# project
SSpara
- 利用B.Sim提出的广义频谱相减法估算的语音增强法,附有相应程序和文献。-B. Sim made use of the broad spectrum subtraction estimate Speech Enhancement Act, with the corresponding procedures and documentation.
HTS-2.1_for_HTK-3.4
- HTS version 2.1 includes hidden semi-Markov model (HSMM) training/adaptation/synthesis, speech parameter generation algorithm considering global variance (GV), SMAPLR/CSMAPLR adaptation, and other minor new features.
G.729code
- 多媒体技术,G729语音编码源代码。参考协议ITU-G.729,Annex B ANSI-C Source Code-Multimedia technology, G729 speech coding source code. Reference to the agreement ITU-G.729, Annex B ANSI-C Source Code
1234567
- 基于b/s架构的tts转换,为初学者提供比较好的参考-Based on b/s architecture tts conversion, to provide good reference for beginners
021127
- 基于b/s架构的语音合成及识别程序!利用js调用actiex控件进行语音识别和合成,值得做语音这块的程序员参考-Based on b/s speech synthesis and recognition framework program! Js call actiex controls using speech recognition and synthesis, it is worth doing voice-piece of the programmer s reference
dianhuabohaoyuyinshibie
- 双音多频 DTMF( Dual Tone Multi-Frequency )信号,是用两个特定的单 音频率信号的组合来代表数字或功能。在 DTMF 电话机中有 16 个按键,其中 10 个数字键 0 — 9 , 6 个功能键 * 、 # 、 A 、 B 、 C 、 D 。其中 12 个按键是我们比较熟悉的按键,另外由第 4 列确定的按键作为保留,作为功能 键留为今后他用。 根据 CCITT 建议,国际上采用 697Hz 、 770Hz 、 852Hz 、 94lHz 低频群及
voicebox
- 此代码包含了语音信息处理的基本上所有要用到的函数。这是语音信号处理的工具箱,对与研究语音信号处理的人来说,这个工具箱是必不可少的-This code contains the voice information processing essentially all use to the function. This is the speech signal processing toolbox, and research on human speech signal processing, th
iLBC1
- Sound, Image and Video Compression and Coding International Hellenic University - Virtual Labs Entropy Coding Quantization Transform Coding MP3 - Psycoacoustics MP3 Compression JPEG MPEG, Block Matching Motion Estimation - P-Frame
SVioceRecognic
- 使用科大讯飞的语音识别和语音合成引擎作为基础,开发的适合SP的一套语音识别应用程序,可通过简单配置应用于现有的IVR业务中。主要文件简单说明:asr.c 系统初始化,入入口点asrch.c 系统核心处理单元DynGra.c 动态语法生成tts.c 语音合成eventq.c 事件队列asrtimer.c 计时器iblock.c 内存管理. -Science and Technology Institute of China to use speech recognition to fly in
watermark
- A great deal of information is now being created, stored, and distributed in digital form. Newspapers, and magazines, for example, have gone online to provide real-time coverage of stories with high-quality audio, images, and even video sequences. Th
anglecos
- 利用夹角余弦距离进行样本数据分类。实现步骤主要分为以下两部分:a、待测样品X与训练集里每个样品Xi的距离采用夹角余弦距离公式计算。b、循环计算待测样品和训练集中各已知样品之间的距离,找出距离待测样品最近的已知样品,该已知样品的类别就是待测样品的类别。-Using the sample data classification Angle cosine distance.Implementation steps are divided into the following two parts: a,
ampp
- 用于提取语音信号的能量谱的函数,ampp(s,a,b,c),x为输入信号,a,b,c用于指定subplot函数的作图位置-A function for extracting the energy spectrum of the speech signal, ampp (s, a, b, c), x is the input signal, a, b, c is used to specify the location of subplot mapping function
zcrr
- 用于计算语音信号过零率的函数zcrr,zcrr(x,a,b,i),x为输入,a,b,c为指定subplot函数的参数-Function zcrr for calculating zero-crossing rate of the speech signal, zcrr (x, a, b, i), x is the input, a, b, c as a function of the parameter specifies subplot
G729Bcoder(C--coder)
- G.729B语音压缩编解码的实现程序,非常详细,C语言编的,亲测有效。-G. 729 b voice compression codec implementation procedures, very detailed, the C language plait, effective measurement.
chenxu
- (1)录制一段语音信号,完成对信号的采样,画出信号的时域波形和频谱图,确定信号的频谱范围; (2)给信号叠加噪声(噪声类型分为如下几种:a白噪声;b单频噪色(正弦干扰);c多频噪声(多正弦干扰);d其它干扰。),画出受噪声干扰的信号时域波形和频谱图; (3)采用窗函数法设计FIR低通滤波器,画出滤波器的频响特性图; (4)用所设计的滤波器对受噪声影响的信号进行滤波,画出滤波后语音信号的时域波形图和频谱图; (5)对滤波前后的信号进行对比,分析信号的变化;回放语音信号,并与原始语音信号对比