虫虫首页| 资源下载| 资源专辑| 精品软件
登录| 注册

SPEECH

  • 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能

    一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of SPEECH Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character Language Modeling)、医学文献下载/解析/索引(MEDLINE Download, Parsing and Indexing)、数据库文本挖掘(Database Text Mining)、中文分词(Chinese Word Segmentation)、情感分析(Sentiment Analysis)、语言辨别(Language Identification)等API。

    标签: LingPipe Java 自然语言处理 开源

    上传时间: 2013-12-04

    上传用户:15071087253

  • 3G Mobile Open Wide Door For E-commerce The 3G mobile Internet business in 3G era will obtain rapid

    3G Mobile Open Wide Door For E-commerce The 3G mobile Internet business in 3G era will obtain rapid development, but it still cannot become the ma in 3G era. In the age of 3G SPEECH business, but is still a subject of value-added business will have great development, the 3G mobile Internet business.

    标签: E-commerce 3G Internet business

    上传时间: 2017-03-23

    上传用户:zjf3110

  • Java Media APIs: Cross-Platform Imaging, Media, and Visualization presents integrated Java media sol

    Java Media APIs: Cross-Platform Imaging, Media, and Visualization presents integrated Java media solutions that demonstrate the best practices for using this diverse collection. According to Sun MicroSystems, "This set of APIs supports the integration of audio and video clips, animated presentations, 2D fonts, graphics, and images, as well as SPEECH input/output and 3D models." By presenting each API in the context of its appropriate use within an integrated media application, the authors both illustrate the potential of the APIs and offer the architectural guidance necessary to build compelling programs.

    标签: Media Java Cross-Platform Visualization

    上传时间: 2013-12-04

    上传用户:hanli8870

  • Face Transfer is a method for mapping videorecorded perfor-mances of one individual to facial animat

    Face Transfer is a method for mapping videorecorded perfor-mances of one individual to facial animations of another. It extracts visemes (SPEECH-related mouth articulations), expressions, and three-dimensional (3D) pose from monocular video or 铿乴m footage.

    标签: videorecorded perfor-mances individual Transfer

    上传时间: 2014-01-11

    上传用户:2467478207

  • The philosophy of the book is to present various pattern recognition tasks in a unified way,

    The philosophy of the book is to present various pattern recognition tasks in a unified way, including image analysis, SPEECH processing, and communication applications. Despite their differences, these areas do share common features and their study can only benefit from a unified approach.

    标签: recognition philosophy pattern present

    上传时间: 2017-05-05

    上传用户:plsee

  • C Algorithms for Real-Time DSP Chapter 5 presents several real-time DSP applications, including spe

    C Algorithms for Real-Time DSP Chapter 5 presents several real-time DSP applications, including SPEECH compression music signal processing radar signal processing and adaptive signal processing techniques.

    标签: applications Algorithms Real-Time DSP

    上传时间: 2013-12-27

    上传用户:王者A

  • this file has codes that describes how to ccmpute the signal spectrum , the power spectrum, how to c

    this file has codes that describes how to ccmpute the signal spectrum , the power spectrum, how to calculate the autocorrelation sequence of a signal, how to calculate the autoregressive coeffecients of a signal,and how to reduce the noisy elements in a SPEECH sample.

    标签: spectrum describes how the

    上传时间: 2017-07-23

    上传用户:金宜

  • Commercially available active noise control headphones rely on fixed analog controllers to drive "an

    Commercially available active noise control headphones rely on fixed analog controllers to drive "anti-noise" loudspeakers. Our design uses an adaptive controller to optimally cancel unwanted acoustic noise. This headphone would be particularly useful for workers who operate or work near heavy machinery and engines because the noise is selectively eliminated. Desired sounds, such as SPEECH and warning signals, are left to be heard clearly. The adaptive control algorithm is implemented on a Texas Instruments (TI™ ) 1 TMS320C30GEL digital signal processor (DSP), which drives a Sony CD550 headphone/microphone system. Our experiments indicate that adaptive noise control results in a dramatic improvement in performance over fixed noise control. This improvement is due to the availability of high-performance programmable DSPs and the self-optimizing and tracking capabilities of the adaptive controller in response to the surrounding noise.

    标签: Commercially controllers headphones available

    上传时间: 2013-12-04

    上传用户:dyctj

  • Microphone Arrays : A Tutorial

    This report presents a tutorial of fundamental array processing and beamforming theory relevant to microphone array SPEECH processing. A microphone array consists of multiple microphones placed at different spatial locations. Built upon a knowledge of sound propagation principles, the multiple inputs can be manipulated to enhance or attenuate signals emanating from particular directions. In this way, microphone arrays provide a means of enhancing a desired signal in the presence of corrupting noise sources. Moreover, this enhancement is based purely on knowledge of the source location, and so microphone array techniques are applicable to a wide variety of noise types. Microphone arrays have great potential in practical applications of SPEECH processing, due to their ability to provide both noise robustness and hands-free signal acquisition.

    标签: Microphone array Tutorial Array Signal Processing

    上传时间: 2016-06-12

    上传用户:halias

  • 基于频率插值的4.0kbps 语音编码器的性能和设计(英文)

    The 4.0 kbit/s SPEECH codec described in this paper is based on a Frequency Domain Interpolative (FDI) coding technique, which belongs to the class of prototype waveform Interpolation (PWI) coding techniques. The codec also has an integrated voice activity detector (VAD) and a noise reduction capability. The input signal is subjected to LPC analysis and the prediction residual is separated into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) components. The SEW magnitude component is quantized using a hierarchical predictive vector quantization approach. The REW magnitude is quantized using a gain and a sub-band based shape. SEW and REW phases are derived at the decoder using a phase model, based on a transmitted measure of voice periodicity. The spectral (LSP) parameters are quantized using a combination of scalar and vector quantizers. The 4.0 kbits/s coder has an algorithmic delay of 60 ms and an estimated floating point complexity of 21.5 MIPS. The performance of this coder has been evaluated using in-house MOS tests under various conditions such as background noise. channel errors, self-tandem. and DTX mode of operation, and has been shown to be statistically equivalent to ITU-T (3.729 8 kbps codec across all conditions tested.

    标签: frequency-domain interpolation performance Design kbit_s SPEECH coder based and of

    上传时间: 2018-04-08

    上传用户:kilohorse