Lincoln Laboratory, SHBT
 

Daryush's Home
Speech-related Info

Online databases

PubMed (Medical database)
Web of Science (popular scientific journal database)

MIT Libraries (MIT)
Vera catalog (MIT)
MIT Interlibrary Loan (ILLiad)
Treadwell (MGH)

IEEE Xplore (IEEE conference proceedings, magazines, journal articles, ...)
eCommons (MEEI/Harvard)

Reference journals

Acta Acustica united with Acustica (by EAA, since 2001)
    Acustica (1951-1996)
    Acustica united with Acta Acustica (1996-2001)
    Akustische Zeitschrift (1936-1951)
Acta Oto-Laryngologica (eCommons, 1998-Present, except recent 12 months)
Annals of Otology, Rhinology, and Laryngology (eCommons, 2002-Present)
Bell System Technical Journal (MIT archives)
Clinical Otolaryngology (v.24:issue 1 (1999:Jan.) - Present)
Computer Music Journal (v.23 (1999) - present)
Current Opinion in Otolaryngology and Head and Neck Surgery (MIT not subscribed)
European Archives of Otorhinolaryngology (MIT subscribed)
Folia Phoniatrica et Logopaedica - International Journal of Phoniatrics, Speech Therapy, and Communication Pathology
IEEE Signal Processing Letters (1994-Present)
IEEE Transactions on Acoustics, Speech, and Signal Processing (1974-1990)
IEEE Audio, Speech, and Language Processing (2005-Present)
IEEE Transactions on Biomedical Signal Processing (1988-Present)
IEEE Transactions on Speech and Audio Processing (1993-2005)
Journal of Phonetics (v.23 (1995) - present)
Journal of Sound and Vibration
Journal of Speech and Hearing Disorders (JSHD) (merged with JSHR in 1991)
Journal of Speech, Language, and Hearing Research (JSHR up to 1997) (4/01/1997 - 12/01/2005)
Journal of the Acoustical Society of America (JASA) (all)
Journal of the Biological Photographic Association - now Journal of Biocommunication
Journal of Voice (MIT not subscribed)
Laryngoscope - The Triological Society (MIT not subscribed, Treadwell)
Lasers in Surgery and Medicine (MIT subscribed, online--v.20:issue 1 - Present)
Logopedics Phoniatrics Vocology - Scandinavian Cooperation Council of Logopedics and Phoniatrics and the British Voice Association (MIT not subscribed)
Machine Vision and Applications (MIT subscribed, v.9:issue 4 - Present)
Methods of Information in Medicine (MIT not subscribed)
Phonoscope (ceased publication, 1999)
Speech Communication (v.16 (1995) - present)

More links...

Conferences

Acoustical Society of America (ASA)
INTERSPEECH -- Eurospeech and ICSLP (International Conference on Spoken Language Processing) by Int'l Speech Communication Assoc. (ISCA)
Voice Foundation Annual Symposium
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), see IEEE Xplore

Workshop on Applications of Speech Processing to Audio and Acoustics (WASPAA), see IEEE Xplore
Vocal Fold Physiology/Vocal Physiology conferences, see Ken Stevens' library

Meeting Location Dates Abstract/Paper Deadline Acceptance
The Triological Society Orlando, FL Feb 4-7, 2010 Abstract Oct 15, 2009 ?
ABEA Las Vegas, NV Apr 28-29, 2010 Abstract Oct 15, 2009 ?
ALA Las Vegas, NV Apr 28-29, 2010 Abstract Oct 31, 2009 Feb 2010
Voice Foundation Philadelphia, PA Jun 2-6, 2009 Abstract Oct 31, 2009 ?
ICVPB/ICALB Madison, WI Jul 6-10, 2010 Abstract Nov 1, 2009 Feb 1, 2010
ASA Baltimore, MD Apr 19-23, 2010 Abstract ? ?
INTERSPEECH Makuhari, Japan Sep 26-30, 2010 Paper Apr 30, 2010 July 2, 2009
ICASSP Prague, Czech Republic May 22-27, 2011 Paper ? ?

Organizations

Acoustical Society of America (ASA)
IEEE Signal Processing Society (SPS)
Voice Foundation
European Acoustics Association (EAA)
American Speech-Language-Hearing Association (ASHA)
Linguistic Data Consortium (LDC)

Labs

MGH Voice Center
Ken Stevens' Speech Communication Group
Speech, Music, and Hearing (TMH) at KTH (Kungliga Tekniska högskolan--Royal Institute of Technology, Stockholm)
UCL Dept. of Phonetics and Linguistics

Center for Spoken Language Research (CSLR), Univ. of Colorado-Boulder
    Robust Speech Processing Laboratory (RSPL)
KayPENTAX

Cochlear Implant Lab (Philip Loizou at UT-Dallas)
National Center for Voice and Speech (Ingo Titze at the Univ. of Iowa)
UCLA Speech Processing and Auditory Perception Laboratory (SPAPL)
CCRMA at Stanford (Julius O. Smith, III)

Speech Tools

Telecommunications and Speech Processing (good for nsp -> wav?)
Audacity (can do wav->mp3)
Goldwave (shareware)
MATLAB (licensed)
    The Mathworks File Exchange for Audio, Video, Speech Processing

    A Matlab Tour of Wavelet Processing by Gabriel Peyré
    Malcolm Slaney's Auditory Toolbox
    Education software for speech coding, Arizona St. Univ.
    Code from RSPL
    COLEA (UT-Dallas)
Praat by Paul Boersma and David Weenink at Univ. of Amsterdam
    Resource links

Speech Filing System (SFS) and associated tools by UCL (Univ. College-London):
    Enhance, enhancement of speech intelligibility by subtracting stationary noise and increasing amplitudes, etc.
    ESynth, Synthesis of sounds
    ESystem, learning signals and systems
    RTGram, Real-time spectrogram
    RTSpect, Real-time spectra
    WASP, Windows tool for SPeech analysis
WaveSurfer by KTH
Winpitch (never used it)

Other

The cool AT&T text-to-speech synthesizer (Rosa, Mike, et al.)
Links to acoustics demos and research
Dan Russell's page of acoustics demos

Rarewares (mp3 encoder)
TCL/TK
Audio formats:
    wav
    mp3 (mpeg 1, audio layer 3)
    aac (mpeg 2, part 7)
    xac (extended audio)
    aiff (Apple)
    aifc (Apple-C)
    iff (amiga)
    au (Sun)
    voc (Sound Blaster)
    snd (raw)
    sds (MIDI instrument sample)
    smp (Sample Vision)
    vox (Dialogic)
    flac (free lossless audio codec)
    wma (Windows Media Audio)
    ogg (Ogg Vorbis)

Links for lossy formats [ref]:
    MPEG-4 AAC, used by LiquidAudio and Apple Computer's iTunes Music Store
    AC-3, used in Dolby Digital and one of the authorized audio formats for DVD use
    ATRAC, used in Sony's Minidisc
    MP2, MPEG-1/2 Audio Layer 2, MP3's predecessor
    mp3PRO from Thomson Multimedia combining MP3 with SBR
    MP3, MPEG-1 audio layer 3
    MPC, also known as Musepack (formerly MP+), an open source derivative of MP2 designed for high bit-rates (180 kbit/s)
    QDesign, used in QuickTime at high bitrates
    AMR-WB+ Enhanced Adaptive Multi Rate WideBand codec, optimized for cellular and other limited bandwidth use
    RealAudio from RealNetworks, frequently in use for streaming on websites

 
Last updated: September 22, 2009