Cepstral Publications
-
- Laura Mayfield Tomokiyo, Carol J. Sisson, Alan W Black (2006).
Mixed-mode Multilinguality in TTS: The Case of Canadian French. Multiling, Stellenbosch, South Africa, 2006.
- Laura Mayfield Tomokiyo, Alan W Black, Kevin A. Lenzo (2005).
Foreign Accents in Synthetic Speech: Development and Evaluation. Eurospeech, Lisbon, 2005.
- Laura Mayfield Tomokiyo (2005).
Speaking Globally: Building the Voices of the World. SpeechTEK, San Francisco, Spring 2005.
-
Alan Black and Kevin Lenzo (2004).
Multilingual Text-to-Speech Synthesis. ICASSP, Montreal, Canada, 2004.
- Laura Mayfield Tomokiyo (2004).
Please Understand Me! Toward Modeling of Non-native Speech in Automatic Speech Recognition. Invited talk, Acoustical Society of America, San Diego, November 2004.
- Alan Black and Kevin Lenzo (2003).
Optimal Utterance Selection for Unit Selection Speech Synthesis Databases. International Journal of Speech Technology, 6(4):357-363, October 2003, Kluwer Academic Publishers.
- Laura Mayfield Tomokiyo, Alan W Black, Kevin A. Lenzo (2003).
Arabic in my Hand: Small-footprint Synthesis of Egyptian Arabic. Eurospeech, Geneva, 2003.
- Waibel, A., Badran, A., Black, A., Frederking, R., Gates, D., Lavie, A., Levin, L., Lenzo, K., Mayfield Tomokiyo, L., Reichert, J., Schultz, T., Wallace, D., Woszczyna, M., and Zhang, J. (2003).
Speechalator: two-way speech-to-speech translation on a consumer PDA. Eurospeech, Geneva, 2003.
- Alan Black (2002).
Perfect Synthesis for all of the people all of the time. Keynote, IEEE TTS Workshop Santa Monica, CA, 2002.
- Kevin Lenzo and Alan Black (2002).
Customized synthesis: blending and tiering. AVIOS2002, San Jose, CA, 2002.
Related publications: Alan W Black
2005
- Suebvisai, S., Charoenpornsawat, P., Black, A., Woszczyna, M., and
Schultz, T.,
Thai Automatic Speech Recognition,
ICASSP, Philadelphia, Pennsylvania, 2005.
(pdf)
- Langner, B. and Black, A.,
Improving the Understandability of Speech Synthesis by Modeling Speech in Noise
ICASSP, Philadelphia, Pennsylvania, 2005.
(pdf)
- Bennett, C. and Black, A.,
Prediction of Pronunciation Variations for Speech Synthesis: A Data-driven
approach
ICASSP, Philadelphia, Pennsylvania, 2005.
(pdf)
- Toda, T., Black, A., and Tokuda, K.
Spectral Conversion Based on Maximum Likelihood Estimation
Considering Global Variance of Converted Parameter
ICASSP, Philadelphia, Pennsylvania, 2005.
(pdf)
- Carbobell, J., Lavie, A., Levin, L., and Black A.
Language Technologies for Humanitarian Aid,
in Technology for Humanitarian Action, eds K Cahill, Fordham
University Press, 2005.
2004
- Kominek, J., and Black, A. (2004)
A Family-of-Models Approach to HMM-based Segmentation
for Unit Selection Speech Synthesis, ICSLP2004, Jeju, Korea,
(pdf)
- Toda, T., and Black, A., and Tokuda, K. (2004)
Acoustic-to-Articulatory Inversion Mapping with Gaussian
Mixture Model,
ICSLP2004, Jeju, Korea,
(pdf)
- Maskey, S., Tomokiyo, L., and Black, A. (2004)
Boostrapping Phonetic Lexicons for New Languages,
ICSLP2004, Jeju, Korea,
(pdf)
- Harris, T., Bannerjee, S., Rudnicky, A., Sison, J., Bodine, K. and
Black, A. (2004)
A research platform for multi-agent dialogue dynamics
Proceedings of The IEEE International Workshop on Robotics and Human Interactive Communications.
(pdf)
- Tokuda, K., Zen, H. and Black, A. (2004)
An HMM-based approach to multilingual speech synthesis,
in Narayanan, S. and Alwan, A. (eds) "Text to Speech Synthesis: New Paradigms and Advances", Prentice Hall.
-
Toda, T., Black, A. and Tokuda, K. (2004)
Mapping from Articulatory Movements to Vocal
Tract Spectrum with Gaussian
Mixture Model for Articulatory
Speech Synthesis, pp 31-36,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(pdf)
-
H/Mariam, S., Kishore, S., Black, A., Kumar, R., and Sangal, R. (2004)
Unit Selection Voice for Amharic Using Festvox
pp 103-107,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(
pdf
)
-
Kominek, J. and Black, A. (2004)
Impact of durational outlier removal from unit selection catalogs
pp 155-160,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(
pdf
)
-
Zhang, J., Toth, A., Collins-Thompson, K. and Black A. (2004)
Prominence Prediction For Super-Sentential Prosodic Modeling Based On
A New Database,
pp 203-208,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(
pdf
)
-
Kominek, J. and Black, A. (2004)
The CMU Arctic speech databases
pp 223-224,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(
pdf
)
-
Langner, B. and Black, A. (2004)
Creating A Database Of Speech In Noise For Unit Selection Synthesis
pp 229-230,
5th ISCA Speech Synthesis Workshop, Pittsburgh, PA.
(pdf)
-
Black, A. and Lenzo, K. (2004)
Multilingual Text-to-Speech Synthesis
ICASSP 2004, Montreal, Canada.
(
pdf
)
-
Schultz. T., Alexander, D., Black, A., Petersen, K., Suebvisai, S. and
Waibel, A.
(2004)
A Thai Speech Translation System For Medical Dialogs
HLT/NAACL 2004, Boston, MA.
(
pdf
)
2003
-
Raux, A. and Black, A. (2003)
A Unit Selection Approach to F0 Modeling and Its Application to Emphasis
ASRU 2003, St Thomas, US Virgin Is.
(
pdf,
)
- Black, A. and Lenzo, K. (2003) Optimal Utterance Selection for Unit Selection Speech Synthesis Databases
International Journal of Speech Technology, 6(4):357-363, October 2003,
Kluwer Academic Publishers.
-
Kishore, S., Black, A., Kumar, R., and Sangal, R. (2003) Experiments
with Unit Selection Speech Databases for Indian Languages
Presented at National seminar on Language Technology Tools:
Implementation of Telugu October 2003, Hyderabad, INDIA
(
pdf
)
- Kominek, J. and Black, A. (2003) CMU ARCTIC databases for speech synthesis
CMU Language Technologies Institute, Tech Report CMU-LTI-03-177
(pdf,
data).
-
Black, A. (2003) Unit Selection and Emotional Speech,
Eurospeech 2003, Geneva, Switzerland.
(
pdf,
html
)
-
Mayfield Tomokiyo, L., Black, A. and Lenzo, K. (2003)
Arabic in my Hand: Small-footprint Synthesis of Egyptian Arabic, Eurospeech 2003, Geneva,
Switzerland.
(
pdf,
html
)
-
Kishore, S. and Black, A. (2003) Unit Size in Unit Selection Speech Synthesis, Eurospeech 2003, Geneva, Switzerland.
(
pdf,
html
)
-
Raux, A., Langner, B., Black, A. and Eskenazi, M. (2003)
LET'S GO: Improving Spoken Dialog Systems for the Elderly and Non-natives,
Eurospeech 2003, Geneva, Switzerland.
(
pdf,
html
)
-
Zhang, J., Black, A. and Sproat, R. (2003)
Identifying Speakers in Children's Stories for Speech Synthesis,
Eurospeech 2003, Geneva, Switzerland.
(
pdf,
html
)
-
Waibel, A., Badran, A., Black, A., Frederking, R., Gates, D., Lavie, A.,
Levin, L., Lenzo, K., Mayfield Tomokiyo, L., Reichert, J., Schultz, T.,
Wallace, D., Woszczyna, M., and Zhang, J. (2003)
Speechalator: two-way speech-to-speech translation on a consumer PDA,
Eurospeech 2003, Geneva, Switzerland.
(pdf,
html)
-
Bennett, C. and Black, A. (2003) Using Acoustic Models to Choose
Pronunciation Variations for Synthetic Voices, Eurospeech 2003,
Geneva, Switzerland.
(pdf,
html)
-
Kominek, J., Bennett, C. and Black, A. (2003) Evaluating and
Correcting Phoneme Segmentation for Unit Selection Synthesis,
Eurospeech 2003, Geneva, Switzerland.
(pdf,
html)
-
Waibel, A., Badran, A., Black, A., Frederking, R., Gates, D., Lavie, A.,
Levin, L., Lenzo, K., Mayfield Tomokiyo, L., Reichert, J., Schultz, T.,
Wallace, D., Woszczyna, M., and Zhang, J. (2003)
Speechalator: two-way speech-to-speech translation in your hand
Demo at HLT-NAACL2003, Edmonton, Canada.
(
pdf,
html
)
2002
-
Black, A. (2002)
Perfect Synthesis for all of the people all of the time. Keynote,
IEEE TTS Workshop 2002, Santa Monica, CA.
(
pdf,
html,
slides.pdf
)
-
Black, A. and Font Llitjos, A. (2002)
Unit selection without a phoneme set
IEEE TTS Workshop 2002, Santa Monica, CA.
(
pdf,
html
)
-
Tokuda, K., Zen, H., and Black, A. (2002)
An HMM-Based Speech Synthesis System applied to English
IEEE TTS Workshop 2002, Santa Monica, CA.
(
pdf
)
-
Black, A., Brown, R., Frederking, R, Lenzo, K. Moody, J, Rudnicky, A., Singh, R., and Steinbrecher, E. (2002)
Rapid Development of Speech-to-Speech Translation Systems
ICSLP2002, Denver, CO.
(
pdf
)
-
Bennett, C. Font Llitjos, A. Shriver, S., Rudnicky, A. and Black, A. (2002)
Building VoiceXML-based applications
ICSLP2002, Denver, CO.
(
pdf,
)
-
Tokuda, K., Zen, H., and Black, A. (2002)
An HMM-based Approach to English Speech Synthesis
Proc. of Autumn Meeting of the Acoustical Society of Japan, 3-10-15, Sep. 2002.
-
Lenzo, K. and Black, (2002)
Customized synthesis: blending and tiering
AVIOS2002, San Jose, CA.
-
Frederking, R., Black, A., Brown, R., Rudnicky, A., Moody, J., and Steinbrecher, E. (2002)
Speech Translation on a Tight Budget Without Enough Data,
ACL-02 Workshop on Speech-to-Speech Translation: Algorithms and Systems, Philadelphia, PA.
-
Black, A., Eskenazi, M. and Simmons, R. (2002)
Elderly perception of speech from a computer,
143rd Meeting: Acoustical Society of America, Pittsburgh, PA, June 2002.
(
slides
)
-
Font Llitjos, A., and Black, A. (2002)
Evaluation and collection of proper name pronunciations online,
LREC2002, Las Palmas, Canary Islands.
(
pdf
)
-
Frederking, R., Black, A., Brown, R., Moody, J. and Steinbrecher, E. (2002)
Field Testing the Tongues Speech-to-Speech Machine Translation System,
LREC2002, Las Palmas, Canary Islands.
(
pdf
)
- Black, A., Brown, R., Frederking, R., Singh R., Moody, J. and Steinbrecher, E. (2002) TONGUES: Rapid Development of a Speech-to-Speech
Translation System, HLT2002, San Diego, California.
(
postscript,
html
)
2001
- Black, A., Dusterhoff, K., and Taylor, P. (2001)
Using the Tilt Intonation Model: A Data-Driven Approach,
in Damper, R. (eds) "Data-Driven Techniques in Speech Synthesis",
Kluwer, Dordrecht, The Netherlands.
-
Font Llitjos, A. and Black, A. (2001) Knowledge of Language Origin
Improves Pronunciation Accuracy of Proper Names,
Eurospeech 2001, Aalborg, Denmark.
(pdf)
- Eskenazi, M. and Black, A. (2001) A study on speech over the telephone and aging,
Eurospeech 2001, Aalborg, Denmark.
(
postscript,
html
)
- Black, A. and Lenzo, K. (2001) Optimal Data Selection for Unit Selection Synthesis, pp 63-67,
ISCA, 4th Speech Synthesis Workshop, Scotland.
(
postscript,
html
)
- Black, A. and Lenzo, K. (2001) Flite: a small fast run-time synthesis engine, pp 157-162,
ISCA, 4th Speech Synthesis Workshop, Scotland.
(
postscript,
html
)
- Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorf, M. and Richards, C.
(2001) Normalization of Non-standard Words, Computer Speech and
Language 15(3) pp 287-333.
- Taylor, P., Black, A., and Caley, R.
(2001) Hetrogeneous Relation Graphs as a Mechanism for Representing
Linguistic Information, Speech Communications
33 pp 153-174.
2000
- Black, A. and Lenzo, K. (2000) Limited Domain Synthesis,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Lenzo, K. and Black, A. (2000) Diphone collection and Synthesis,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Chotimongkol, A. and Black, A. (2000)
Statistically trained orthographic to sound models for Thai,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Olinsky, C. and Black, A. (2000)
Non-Standard Word and Homograph Resolution for Asian Language
Text Analysis,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Shriver, S., Black, A. and Rosenfeld, R. (2000)
Audio Signals in Speech Interfaces,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Rudnicky, A., Bennet, T., Black, A., Chotmongkol, A., Lenzo K., Oh, A.
and Singh R. (2000)
Task and Domain Specific Modelling in the Carnegie Mellon
Communicator System,
ICSLP2000, Beijing, China.
(
postscript,
html
)
- Rosenfeld, R., Zhu, X., Toth, A., Shriver, S., Lenzo, K. and Black, A.
(2000)
Towards a Universal Speech Interface,
ICSLP2000, Beijing, China.
(
postscript,
html
)
-
Black, A. and Lenzo, K. (2000) Building Voices in the Festival
Speech Synthesis System, DRAFT (updated 2003)
(postscript)
(html)
1999
-
Paul Taylor and Alan W Black (1999). Speech Synthesis by Phonological Structure Matching, in Eurospeech99 postscript
-
Janet Hitzeman, Alan W. Black, Chris Mellish Jon Oberlander, Massimo Poesio and
Paul Taylor (1999). An Annotation Scheme for Concept-to-Speech Synthesis,
in Proceedings of the European Workshop on
Natural Language Generation, pp. 59-66.
postscript
-
Kurt E. Dusterhoff, Alan W. Black and Paul A. Taylor (1999).
Using Decision Trees within the Tilt
Intonation Model to Predict F0 Contours, in Eurospeech 99
postscript
1998
-
Black, A., Lenzo, K. and Pagel, V. (1998) Issues in Building General Letter
to Sound Rules
(postscript,
html)
3rd ESCA Workshop on Speech Synthesis, pp. 77-80, Jenolan
Caves, Australia,
-
Syrdal, A., Moehler, G., Dusterhoff, K., Conkie, A, and Black, A. (1998)
Three Methods of Intonation Modeling , 3rd ESCA Workshop on Speech
Synthesis, pp. 305-310, Jenolan Caves, Australia,
postscript
-
Taylor, P., Black, A. and Caley, R. (1998) The architecture of the
Festival Speech Synthesis System,
(postscript,
html)
3rd ESCA Workshop
on Speech Synthesis, pp. 147-151, Jenolan Caves, Australia,
-
Hitzeman, J., Black, A., Mellish, C., Oberlander, J. and Taylor, P. (1998)
On the Use of Automatically Generated Discourse-level Information in a
Concept-to-Speech Synthesis System
(postscript)
ICSLP98 vol 6 pp 2763-2768, Syndey, Australia.
-
Pagel, V., Lenzo, K. and Black, A. (1998) Letter to sound rules for
accented lexicon compression
(postscript)
ICSLP98, vol 5 pp 2015-2020, Syndey, Australia
-
Sproat, R., Hunt, A., Ostendorf, M., Taylor, P., Black, A., Lenzo, K.
and Edgington, M. (1998) SABLE: A standard for TTS markup
(postscript)
ICSLP98, vol 5, pp 1719-1724, Syndey, Australia, also in
3rd ESCA Workshop
on Speech Synthesis, pp. 27-30, Jenolan Caves, Australia,
-
Taylor, P. and Black, A. (1998).
Assigning Phrase Breaks from part-of-speech Sequences
(postscript,
html)
Computer Speech and Language 12, 99-117.
1997
-
Black, A. and Taylor, P. (1997).
Assigning Phrase Breaks from Part-of-Speech Sequences
(pdf,
html)
Proceedings of Eurospeech 97, vol2 pp 995-998, Rhodes, Greece.
-
Black, A. and Taylor, P. (1997).
Automatically clustering
similar units for unit selection in speech synthesis
(postscript,
html)
Proceedings of Eurospeech 97, vol2 pp 601-604, Rhodes, Greece.
-
Dusterhoff, K. and Black, A. (1997).
Generating F0 contours for speech synthesis using the Tilt intonation theory
(postscript,
html)
Proceedings of ESCA Workshop of Intonation, pp 107-110, September,
Athens, Greece.
-
Black, A. and Taylor, P. (1997).
Festival Speech Synthesis System:
system documentation (1.1.1)
Human Communication Research Centre Technical Report HCRC/TR-83.
1996
-
Black, A. and Hunt, A. (1996).
Generating FO contours from ToBI labels using linear regression
Proceedings of ICSLP 96, vol 3, pp 1385-1388, Philadelphia, Penn.
-
Campbell, N and Black, A. (1996).
CHATR: a multi-lingual speech re-sequencing synthesis system
(In Japanese) Institute of Electronic, Information and Communication
Engineers, Spring Meeting, Tokyo SP-96-07,
-
Hunt, A. and Black, A. (1996).
Unit selection in a concatenative speech
synthesis system using a large speech database Proceedings of
ICASSP 96, vol 1, pp 373-376, Atlanta, Georgia.
-
Campbell, N. and Black, A. (1996)
Prosody and the Selection of Source Units for Concatenative Synthesis,
in "Progress in speech synthesis", eds
J. van Santen, R Sproat, J Olive and J. Hirschberg, pp 279-282,
Springer Verlag.
1995
-
Black, A. and Campbell, N. (1995).
Optimising selection of units from speech databases for concatenative
synthesis Eurospeech 95 vol 1, pp 581-584, Madrid, Spain.
- Black, A. and Campbell, N. (1995)
Predicting the intonation of discourse segments from examples in dialogue
speech, (Short version) ESCA workshop on spoken dialogue systems,
Denmark.
- Black, A. (1995) Predicting the
intonation of discourse segments from examples in dialogue speech,
ATR Workshop on Computational modeling of prosody for spontaneous speech
processing. ATR, Japan. Republished in "Computing Prosody," eds. Y.
Sagisaka, N. Campbell and N. Higuchi, Springer Verlag, 1997.
- Black, A. (1995) Comparison of
algorithms for predicting accent placement in English speech synthesis
Spring meeting of the Acoustical Society of Japan.
1994
- Black, A. and Taylor, P. (1994) Assigning
intonation elements and prosodic phrasing for English speech synthesis
from high level linguistic input, ICSLP94, Yokohama, Japan.
- Taylor, P. and Black, A. (1994)
Synthesizing Conversational Intonation
from a Linguistically Rich Input, Proc. ESCA Workshop
on Speech Synthesis, Mohonk, NY.
- Black, A. and Taylor, P. (1994) CHATR:
a generic speech synthesis system, COLING94, II pp 983-986,
Kyoto, Japan.
- Black, A. and Taylor, P. (1994) A
framework for generating prosody from high level linguistic
descriptions, Spring meeting of the
Acoustical Society of Japan.
- Black, A. (1993) Some different
approaches to DRT, DYANA-II deliverable, R3.2.
- Black, A. (1993), Using Situation
Theory in a computational language for natural language processing,
4th Natural Language Understanding and Logic Programming
Conference, Nara, Japan.
- Black, A. (1993) A situation theoretic
approach to computational semantics, PhD Thesis, Dept of AI,
University of Edinburgh.
- Black, A. (1993), Using a
computational situation theoretic language to investigate
contemporary semantic formalisms, Schloss Dagstuhl Seminar
IBFI, report 57.
- Black, A. (1992) Embedding DRT in
a Situation Theoretic Framework,
pp 1116-1120, COLING92, Nantes, France.
Language Modelling in Speech Recognition
- Foster J, Matheson C, and Black A. (1990)
Modelling Linguistic Constraints for Continuous Speech Recognition
Using Context Free Phrase Structure Grammar, VERBA 90,
International Conference on Speech Technologies, Rome.
- Black, A. (1989) Finite State
Machines from Feature Grammars,
pp 277-285,
International Workshop on Parsing Technologies, Carnegie Mellon University,
Pittsburgh, PA.
Lexicons and Morphology
- Ritchie G, Russell G, Black A and Pulman S. (1992)
Computational Morphology:
practical mechanisms for the English Lexicon,
MIT Press, Cambridge, Mass.
- Black A, van de Plassche J, Williams B. (1991)
Analysis of Unknown Words
through Morphological Decomposition,
pp 101-106,
5th Conference of the European Chapter of the Association for
Computational Linguistics, Berlin, Germany.
- Black A. (1990)
A computational description of
Japanese morphology,
unpublished manuscript, Dept of AI, University of Edinburgh.
- Pulman S, Russell G, Ritchie G and Black A. (1988)
Computational Morphology of English,
Linguistics Volume 26-4:545-560.
- Black A, Ritchie G, Pulman S, and Russell G. (1987)
Formalisms for Morphographemic
Description,
pp 11-18, Proceedings of 3rd Conference
of the European Chapter of the Association for Computational
Linguistics. Copenhagen, Denmark.
- Ritchie G, Pulman S, Black A and Russell G. (1987)
A Computational Framework for
Lexical Description,
Journal of Computational Linguistics,
13,3-4:290-307.
- Russell G, Pulman S, Ritchie G, and Black A. (1986)
Dictionary and Morphological Analyser for English., pp 277-279,
Proceedings of the 11th International Conference on Computational
Linguistics. Bonn, West Germany.
Others
- Black A. (1986)
Formal properties of feature grammars,
unpublished paper, Dept of AI, University of Edinburgh.
- Black A. (1986)
VLSI Design for Context-free Grammar Parsing, Master's Thesis,
Dept of AI, University of Edinburgh.
- Black A (1984)
A Knowledge Based System for Wood Anatomy and
Usage Correlation, Final year dissertation, Dept of
Computer Science, Coventry (Lanchester) Polytechnic.
- Black A (1984)
Complexity Theory and NP-Completeness
Dept of Computer Science, Coventry (Lanchester) Polytechnic.
Related publications: Laura Mayfield Tomokiyo
1993-2001