Preview

Co-Channel Speech Analysis

Good Essays
Open Document
Open Document
959 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Co-Channel Speech Analysis
Abstract: Co-channel speech may be defined as a speech that is mixed with another speech. An example of a co-channel speech signal is generated when two or more people are speaking simultaneously (e.g. the cocktail-party. Processing the co-channel signal, i.e speaker identification, speech recognition, etc.. still have problems. Questions like how many speakers are talking together?, and if it is possible to decide for sure that a certain speaker is presented among the talking people, may need some answers. In the present work, using a predefined recorded speaker signal, target speaker, a proposed spectral correlation technique for both co-channel speech detection, and speaker identification is designed to help in resolving some of these problems. …show more content…
This signal represents a great challenge for automatic speech applications. Performance of automatic speech recognition (ASR) and automatic speaker identification (ASI) systems , has been shown to degrade significantly in the presence of such signal.
If there is more than one speaker in the speech signal, co-channel speech, the speaker identification system will have some troubles to make a correct decision about the target speaker. So, most of the work done in speech recognition, and speaker identification assuming that there is one or two speaker speech signal(s) [1][2][3][4][5][6][10][[13]. A complete speech database like KED TIMIT contains 453 utterances spoken only by a US male speaker [7]. Thus, the goal of speaker identification system, when dealing with a single speaker signal is to say whether that speaker is the target speaker or not. When dealing with a co-channel speech signal, i.e multi-speaker signal, the identification system has been to identify the presence of the target speaker inside the signal [3][4]. If the co-channel speech signal has more than two speakers, three or more, the problem of speaker identification will be more difficult to
…show more content…
At first a literature preview about previous work concerning co-channel multi-speaker signal processing and speaker identification is illustrated. Secondly, the proposed system model is presented to describe the steps of problem solution and system operation. Then, the system is represented mathematically and its algorithms are designed and tested. At last, speaker signal files which are used in the experimental work, collected from Corbus database and some Arabic recorded files are used to test the algorithms and system. Matlab programming environment is utilized to design and run the system algorithms. The output results are plotted and tabulated using Matlab program. Conclusions are summarized and stated at the end of the paper. Future work concerning the point of research is also

You May Also Find These Documents Helpful

  • Good Essays

    Physics Mastery Lab

    • 836 Words
    • 4 Pages

    In order for this equipment to assist one in measuring the speed of sound, the speaker and microphone are positioned inside the hollow tube with the speaker stationary at one end. The microphone is able to be moved and set a chosen distance from the speaker, from almost touching to 1 meter. The signal generator is connected to the speaker by a pair of wires. From this pair of wires, another pair of wires connects the signal generator to the frequency meter. A set of wires also run from the signal generator to the oscilloscope. A separate set of wires is connected from the oscilloscope to the microphone inside the tube. The set up of the equipment allows for the output of the signal meter to be read and measured by the frequency meter while being led to the speaker. This input causes the speaker to vibrate, which produces sound waves inside the tube. These sound waves, picked up by the microphone, are then sent to the oscilloscope as a signal. A pattern is displayed on the screen of the oscilloscope. With the signals in phase, the patterned displayed is a straight diagonal line. With…

    • 836 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Many applications for mobile phones and computers offer a method for listening to radio talk shows and the current popular songs in the world of musical entertainment but one company offers an application that lets the user hear the programming the way it was meant to be heard, through radio.…

    • 814 Words
    • 3 Pages
    Good Essays
  • Good Essays

    The Seneca orator known as Red Jacket, for the red jacket the British Awarded to him for his services as a message runner during the Revolutionary War and Benjamin Franklin both made very valid point in their speeches. The Indians had a very peaceful way of life. They had their own governing and civility system and they taught their young the way of their ancestors, never taking what “The Great Spirit” (pg 230) gave them for granted. They always welcomed strangers; giving them clothes, shelter and food without ever expecting anything in return. Even when this visitors broke their most basic of common rules like announcing their presence before entering a village. “We took pity on them, granted their request; and they sat down among us.…

    • 268 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Alexandra Cousteau is a filmmaker that works with National Geographic. She is recognized for being an advocate of water issues and continuing her grandfather’s work. The speech was held at WSRE Jean and Paul Amos Performance Studio and was sponsored by WSRE Public Square Speakers Series. The studio was like a movie theater with seats going all the way up with the Middle Island where the audio in camera was. The stage had a red rectangle in the middle and a podium off to the right with a big screen in the background. The general purpose of the speech was to give her background and also bring up some of the issues that are plaguing our water systems today. The speaker’s specific purpose was to inform but hopefully persuade people to take action.…

    • 683 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    In the speech delivered by Deng Xiaoping when he met with military officers ranked above the level of army commander on June 9, 1989, Deng Xiaoping scientifically analyzed the situations, definitely and affirmatively proclaimed to the public that the Party should comply with the policy formulated at the Third Plenary Session of the 11th CPC Central Committee, firmly insisted on the strategic goals of “three-step development strategy and uphold the basic line of one central task and two basic points”. He clearly answered some great questions about what banner China would hold and what path China would follow and what direction China would select. As a result, the situation was stabilized, the decision on strategy of reform and opening up was…

    • 350 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Avid Speech Analysis

    • 342 Words
    • 2 Pages

    This was my first year of participating in the avid program, and to tell you the truth, I wasn't sure what to expect. However, I feel that with the help of Mr. Kaiser’s freshmen avid class, my first year of highschool will become one of my most memorable.…

    • 342 Words
    • 2 Pages
    Good Essays
  • Good Essays

    References: Dunston, T. & Yager, N. (2008), “Biometric System and Data Analysis: Design, Evolution and Data Mining”, Springer…

    • 1008 Words
    • 5 Pages
    Good Essays
  • Satisfactory Essays

    Case Study: Caitlin

    • 75 Words
    • 1 Page

    We have an opportunity with active listening. The client provided their name at the beginning of be call and we asked them to repeat it at the end.…

    • 75 Words
    • 1 Page
    Satisfactory Essays
  • Powerful Essays

    Annual day essay

    • 1648 Words
    • 7 Pages

    1. The speaker will be identified by a bolded two letter abbreviation followed by a bolded colon. The speaker identifier will be left justified.…

    • 1648 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Flavi Speech Analysis

    • 758 Words
    • 4 Pages

    Flavia, your paper touches a very deep and emotive aspect of discrimination and using the two essays to explore this, has given the reader another opportunity to connect with the pain that discrimination causes in peoples’ lives. You have engaged with this core issue well within your writing. Well done!…

    • 758 Words
    • 4 Pages
    Good Essays
  • Good Essays

    both music conditions and the changing-state speech compared to quiet and steady-state speech conditions. The lack of…

    • 6361 Words
    • 26 Pages
    Good Essays
  • Good Essays

    Listen To Music Analysis

    • 1004 Words
    • 5 Pages

    of the music. In order to do this, measure the distance between the two speaker…

    • 1004 Words
    • 5 Pages
    Good Essays
  • Good Essays

    In face-to-face communication, meaning is conveyed not only through words but also through tone of voice and body language (facial expressions, hand gestures, etc.). As a result, listeners pay more attention to our tone and body language than to our word choices in order to derive additional clues to our meaning.…

    • 893 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Pest Analysis on Cocacola

    • 3574 Words
    • 15 Pages

    Commercial-In-Confidence EOI – Multimedia Contact Centre System with Voice Logger Page 2 of 18 7/10/2008…

    • 3574 Words
    • 15 Pages
    Powerful Essays
  • Powerful Essays

    Facial expression carries crucial information about the mental, emotional and even physical states of the conversation. Recognition of facial expression in the input image needs two functions: locating a face in the image and recognizing its expression. When we watch two photos of a human face, we can answer which photo shows the facial expression more strongly. In human interaction, the articulation and perception of facial expressions form a communication channel, that is additional to voice and that carries crucial information about the mental, emotional and even physical states of the conversation. It detects face and ignores anything else, such as buildings, trees and bodies. Face detection [17] can be regarded as a more general case of face localization. In face localization, the task is to find the locations and sizes of a known number of faces (usually one). In face detection, face is processed and matched bitwise with the underlying face image in the database. When we seeing a photos of a human face, we can answer which photo shows the facial expression more strongly.The multisensory data are typically processed separately and only combined at the end. People display audio and visual communicative signals in a complementary and redundant manner. In order to accomplish a human-like multimodal analysis of multiple input signals acquired…

    • 804 Words
    • 4 Pages
    Powerful Essays