KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)

Slides:



Advertisements
Similar presentations
Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
Advertisements

Sound in multimedia How many of you like the use of audio in The Universal Machine? What about The Universal Computer? Why or why not? Does your preference.
Sound can make multimedia presentations dynamic and interesting.
Chapter 4: Representation of data in computer systems: Sound OCR Computing for GCSE © Hodder Education 2011.
MUSC1010 – WEEK 3 Adobe Audition. Mixing Waves and Phase Figure 1 - Two simple waves in phase Figure 2 - Two simple waves out of phase Figure 3 - two.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Institute of Information Science Academia Sinica 1 Singer Identification and Clustering of Popular Music Recordings Wei-Ho Tsai
Computational Rhythm and Beat Analysis Nick Berkner.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
/25 Singer Similarity A Brief Literature Review Catherine Lai MUMT-611 MIR March 24,
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
MLP Lyrical Analysis ● % of Unique Words ● # of Unique Words ● Average Word Length ● # of Lyrics ● # of Characters Input Feature Vectors:
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Modeling of Mel Frequency Features for Non Stationary Noise I.AndrianakisP.R.White Signal Processing and Control Group Institute of Sound and Vibration.
1 Manipulating Digital Audio. 2 Digital Manipulation  Extremely powerful manipulation techniques  Cut and paste  Filtering  Frequency domain manipulation.
A PRESENTATION BY SHAMALEE DESHPANDE
ABSTRACT: Noise cancellation systems have been implemented to counter the effects of echoes in communications systems. These systems use algorithms that.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
LE 460 L Acoustics and Experimental Phonetics L-13
Introduction to Interactive Media 10: Audio in Interactive Digital Media.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
1 ELEN 6820 Speech and Audio Processing Prof. D. Ellis Columbia University Midterm Presentation High Quality Music Metacompression Using Repeated- Segment.
Chapter 11 Audio © 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in.
Speech Enhancement Using Spectral Subtraction
Topic 10 - Image Analysis DIGITAL IMAGE PROCESSING Course 3624 Department of Physics and Astronomy Professor Bob Warwick.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
Sound and audio. Table of Content 1.Introduction 2.Properties of sound 3.Characteristics of digital sound 4.Calculate audio data size 5.Benefits of using.
Overview of Multimedia A multimedia presentation might contain: –Text –Animation –Digital Sound Effects –Voices –Video Clips –Photographic Stills –Music.
1 Methods for detection of hidden changes in the EEG H. Hinrikus*, M.Bachmann*, J.Kalda**, M.Säkki**, J.Lass*, R.Tomson* *Biomedical Engineering Center.
Jacob Zurasky ECE5526 – Spring 2011
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
Audio Thumbnailing of Popular Music Using Chroma-Based Representations Matt Williamson Chris Scharf Implementation based on: IEEE Transactions on Multimedia,
Chapter 15 Recording and Editing Sound. 2Practical PC 5 th Edition Chapter 15 Getting Started In this Chapter, you will learn: − How sound capability.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Structure Discovery of Pop Music Using HHMM E6820 Project Jessie Hsu 03/09/05.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Overview ► Recall ► What are sound features? ► Feature detection and extraction ► Features in Sphinx III.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Arlindo Veiga 1,2 Sara Cadeias 1 Carla Lopes 1,2 Fernando Perdigão 1,2 1 Instituto.
Performance Comparison of Speaker and Emotion Recognition
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Segmenting Popular Music Sentence by Sentence Wan-chi Lee.
[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.
Vocabulary For Audio Recording.
ARTIFICIAL NEURAL NETWORKS
Vocoders.
Speech Enhancement with Binaural Cues Derived from a Priori Codebook
KARAOKE (Prinsip & Konsep Dasar)
KARAOKE (Prinsip & Konsep Dasar)
Voice Removal from Music
Chapter 4: Representing sound
A maximum likelihood estimation and training on the fly approach
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Presenter: Shih-Hsiang(士翔)
Music Signal Processing
Auditory Morphing Weyni Clacken
Presentation transcript:

KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)

INTRODUCTION Karaoke is a Japanese abbreviated compound word, "kara" comes from "karappo" meaning empty, and "oke" is the abbreviation of "okesutura," or orchestra. Usually, a recorded popular song consists of vocals and accompaniment. Musical works in which only the accompaniment is recorded were named "karaoke." Karaoke singing involves singing to such recorded accompaniments of popular songs in front of a live audience. After the singer chooses a song from a catalogue, lyrics are usually displayed on a monitor, recorded music plays, and it's Showtime for the novice pop star.

INTRODUCTION (cont.) Invented in the late 1970's, the wild popularity of karaoke over the years has swept this form of singing into the mainstream throughout the world. Karaoke creates its own culture, while reflecting much about the wider culture and the place of popular music as a media form.

Brief Of Algorithm Used Aim - This program attempts to create karaoke of a song. It attempts to remove voice and then writes it to a new wav file. Principle - This program utilizes the fact that voice is recorded equally in both channels without any stereo effect.

Brief Of Algorithm Used Method - First one of the channel is high pass filtered. This is done to protect the low end bass which is also recorded equally on both channels. Then this channel is subtracted from the other. The common part that is voice gets cancelled.

Brief Of Algorithm Used Select the high pass filter cut off about Hz. If there is too much voice in the output reduce this value. If it has too little bass increase this value. Note that by doing so voice will also increase.

Drawbacks Output is mono and a bit harsh. Does not remove chorus. Also fails if the voice is not identical in both channels. It takes only.wav as input. You can convert your.mp3 into.wav using any converter.

So, Let’s Begin

Few Other Algorithms ALGORITHM – 1 It is a two step process comprising of a training phase, during which a statistical model is created for a singer’s voice, and a working phase, during which the starting point of the singing voice is detected and a fixed length of testing data is taken from that point. Audio features extracted from this data are then compared against the existing singers’ models to perform singer identification. Singing voice detection is achieved by extracting features of energy, average zero-crossing rate (ZCR), harmonic coefficients and spectral flux computed at regular intervals which are then compared against a set of predetermined thresholds.

Few Other Algorithms ALGORITHM – II A statistical model to classify segments of musical audio into vocal or non-vocal using a Hidden Markov Model (HMM) classifier. The feature extraction is based on sub-band processing that uses the log frequency power coefficients (LFPC) to provide an indication of the energy distribution among sub-bands. The training model also takes into account tempo and song structure information in song modeling based on the observed variations in intra-song signal characteristics.

Future Advancement To Transcribe the lyrics - but this is currently impractical. Transcription of lyrics using speech recognition is an extremely challenging task as singing differs from speech in many ways. The phonetic and timing modification, presence of meaningless syllables often employed by singers and interference of the instrumental background would make an acoustic classifier trained on normal speech a poor match to the acoustics of the sung vocal line.