Preview

Speech Compression Using Wavelets

Powerful Essays
Open Document
Open Document
2415 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Speech Compression Using Wavelets
Speech Compression Using Wavelets

Team Members:

Deepak Kumar,
3rd B-tech,
E.C.E Branch,
JNTU College of Engineering,
Kakinada.
E-Mail: deepakjntu427@yahoo.com

Ch. Naresh Kumar,
3rd B-Tech,
E.C.E Branch,
JNTU College of Engineering,
Kakinada.

ABSTRACT
Speech compression is the technology of converting human speech into an efficiently encoded representation that can later be decoded to produce a close approximation of the original signal. The wavelet transform of a signal decomposes the original signal into wavelets coefficients at different scales and positions. These coefficients represent the signal in the wavelet domain and all data operations can be performed using just the corresponding wavelet coefficients. The major issues concerning the design of this Wavelet based speech coder are choosing optimal wavelets for speech signals, decomposition level in the DWT, thresholding criteria for coefficient truncation and efficient encoding of truncated coefficients. The performance of the wavelet compression scheme on both male and female spoken sentences is compared. On a male spoken sentence the scheme reaches a signal-to-noise ratio of 17.45 db and a compression ratio of 3.88, using a level dependent thresholding approach.

1. INTRODUCTION
Speech is a very basic way for humans to convey information to one another. With a bandwidth of only 4 kHz, speech can convey information with the emotion of a human voice. People want to be able to hear someone’s voice from anywhere in the world as if the person was in the same room .As a result a greater emphasis is being placed on the design of new and efficient speech coders for voice communication and transmission. Today applications of speech coding and compression have become very numerous. This paper looks at a new technique for analyzing and compressing speech signals using wavelets. Any signal can be represented by a set of scaled and translated versions of a basic function called the. mother



References: [1]. A. Chen, N. Shehad, A. Virani and E. Welsh, Discrete Wavelet Transform for Audio Compression, (current July. 16, 2001). [2]. Speech Compression Using Wavelets by Nikhil Rao [3]. S.Haykin, Communication Systems, John Wiley & Sons, New York, 2001.

You May Also Find These Documents Helpful

  • Good Essays

    Nt1310 Unit 9 Lab Report

    • 3131 Words
    • 13 Pages

    Speech morphing can be achieved by transforming the signal’s representation from the acoustic waveform obtained by sampling of the analog signal, with which many people are familiar with, to another representation. To prepare the signal for the transformation, it is split into a number of 'frames' - sections of the waveform. The transformation is then applied to each frame of the signal. This provides another way of viewing the signal information. The new representation (said to be in the frequency domain) describes the average energy present at each frequency band.…

    • 3131 Words
    • 13 Pages
    Good Essays
  • Powerful Essays

    netwk 320 week 7 i lab

    • 4646 Words
    • 19 Pages

    A codec is a device capable of performing encoding and decoding on a digital signal. Each codec provides a different level of speech quality. The reason for this is that codecs use different types of compression techniques in order to require less bandwidth. The more the compression, the less bandwidth you will require. However, this will ultimately be at the cost of sound quality, as high-compression/low-bandwidth algorithms will not have the same voice quality as low-compression/high-bandwidth algorithms.…

    • 4646 Words
    • 19 Pages
    Powerful Essays
  • Satisfactory Essays

    speech generating devices work by helping an individual communicate verbally. ACC is so important because it helps individuals produce or comprehend written or spoken language.…

    • 438 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Automatic speech recognition is the most successful and accurate of these applications. It is currently making a use of a technique called "shadowing" or sometimes called "voicewriting." Rather than have the speaker's speech directly transcribed by the system, a hearing person…

    • 416 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Text to Speech Engine

    • 432 Words
    • 2 Pages

    In speech generation, there are three basic techniques (in order of increasing complexity): 1) "waveform encoding “, 2) “analog formant frequency synthesis” and 3) "digital vocal tract modeling" of speech. Each of these techniques will be described in brief detail.…

    • 432 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Speech- Speech refers to vocalised sounds. English language has over 40 different sounds that people use to communicate words. The speech is ability of expressing thoughts and feelings by articulate sounds. Speech is learned before written language.…

    • 608 Words
    • 3 Pages
    Good Essays
  • Good Essays

    SPEECH Is the vocalised sounds made by a human of their learned language, to communicate to others.…

    • 962 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    In 2010, in the Yerba Buena Center for the Arts in San Francisco, Apple co-founder Steve Jobs announced the iPad.…

    • 529 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    Ike devoted a lot of his time as President to addressing his utter dislike for atomic bombs. In his presidency there were some instances involving the Cold War that many of his supporters and advisors believed an atomic bomb would be needed. However Eisenhower stood strong on his opinions and resolved the conflicts without using not only atomic bombs but no military force at all. At the UN’s General Assembly meeting in New York of December, 1953 he gave his famous “Atoms for Peace” speech. Just eight years prior the U.S. dropped two atomic bombs in Nagasaki and Hiroshima, Japan. This gave lots of international leaders and civilians the belief that if you got on the bad side of the U.S. that they would just nuke you. Eisenhower wasted to convey…

    • 360 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Text to Speech

    • 781 Words
    • 4 Pages

    At present most speech synthesis systems use raw text as their input which is understandable from a human point of view but problematic for the machines since the process of converting text to speech is very complex; in this paper we discuss the need for having a specific SSML tag for each “mention” (1st occurrence, 2nd occurrence) of a proper noun in the text or paragraph. We discuss that when a proper noun appears first time in the text, then it is spoken more prominently than its second or third or subsequent occurrence. We highlight the need for incorporating a specific tag in SSML to take care of this mention-case. The SSML format is a compromise between human and machine needs. SSML is often embedded in Voice-XML scripts to drive interactive telephony systems. However, it also may be used alone, such as for creating audio books. The advantage that SSML brings is that the designers of such language generation systems need only understand the basic SSML language and do not need specialist speech synthesis knowledge. Introduction Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. SSML directs all Text Analysis steps, providing a standard way to control aspects of speech such as pronunciation, acronym expansion, volume, pitch, rate, range, duration, pause, emphasis, etc., across different synthesis-capable platforms. The intended use of SSML is to improve the quality of synthesized content. Different markup elements impact different stages of the synthesis process. The markup may be produced either automatically, for instance via XSLT or CSS3 from an XHTML document, or by human authoring. Markup may be present within a complete SSML document or as part of a fragment embedded in another language, although no interactions with other languages are specified as…

    • 781 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Data Sonification

    • 573 Words
    • 3 Pages

    Turnage, Bonebright, Buhman, Flowers (1996) showed that untrained participants can listen to shapes. That is, they used data sonification – musical representation of two dimensional space, with pitch as the vertical dimension and time as the horizontal dimension – to present participants the visual and auditory representation of waveforms. In two conditions, they showed the participants could match one visual presentation to one of two auditory representations, or match one auditory presentation to one of two visual presentations.…

    • 573 Words
    • 3 Pages
    Good Essays
  • Good Essays

    After World War 1 feminism began to spread throughout the world. Hitler however didn’t like feminism so In 1934 Gertrud Scholtz-Klink was appointed the National Women’s leader, which gave her the responsibility to look after all the affairs of party and state relating to women and put her in charge of the National Socialists Women’s Association or NSF. In 1935 Scholtz-Klink delivered a speech at the Nazi Party Congress to the members of NSF in which she gave guidelines that helped women to,” combine work and motherhood, while encouraging them to become enthusiastic disciples of National Socialism.”1 Within the speech Scholtz-Klink reveals…

    • 783 Words
    • 4 Pages
    Good Essays
  • Better Essays

    [2] C. Sidney Burrus, Ramesh A. Gopinath, Haitato, "Introduction to Wavelets and Wavelet Transforms, Aprimer," Prentice-Hall, New Jersey, 1998.…

    • 1255 Words
    • 6 Pages
    Better Essays
  • Good Essays

    Speech Analysis Report

    • 533 Words
    • 3 Pages

    The speech I choose to write my speech analysis report is “The Linguistic Genious of Babies”. The speaker is Patricia Kuhl who is internationally recognized for her research on early language and brain development. In her fascinating talk, Patricia Kuhl shares surprising findings about how babies learn one language over another. She logically divides the topic into three major parts as introduction, body and conclusion. I will analyze the speech with respect to these three main parts. Moreover, during my analysis I will focus on how kuhl used the strategies of presenting.…

    • 533 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Kannur University Syllabus

    • 9144 Words
    • 37 Pages

    2. David Salomon, “Data Compression the Complete Reference”, 2nd Edition Springer Verlag, New York Inc, 2001.…

    • 9144 Words
    • 37 Pages
    Powerful Essays