Project publications and presentations

We ask that all publications using the AusTalk corpus include the following acknowledgement:

'The AusTalk corpus was collected as part of the Big ASC project (Burnham et al. 2009; Wagner et al. 2010; Burnham et al. 2011), funded by the Australian Research Council (LE100100211). See: for details.'


Project publications

Cassidy, S., Estival, D., Cox, F. (2017). ‘Case Study: the AusTalk Corpus’. Handbook of Linguistic Annotation, Nancy Ide and James Pustejovsky, eds. Springer.

Sui, Chao, Togneri, Roberto, and Bennamoun, Mohammed (2015). 'Extracting Deep Bottleneck Features For Visual Speech Recognition', in Proceedings of ICASSP 2015, pp. 1518-22.

Sui, Chao, Bennamoun, Mohammed, and Togneri, Roberto (2015). 'Listening With Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines', in Proceedings of ICCV 2015, pp. 154-62.

Estival, D. (2015). “AusTalk and Alveo: An Australian Corpus and Human Communication Science Collaboration Down Under”. In Language Production, Cognition, and the Lexicon. Núria Gala, Reinhard Rapp and Gemma Bel-Enguix, eds. Series: Text, Speech and Language Technology, Vol. 48. Springer. pp.545-560.

Cassidy, S., Estival, D., Cox, F. (2014). "AusTalk Annotation report".

Togneri, R., Bennamoun, M. and Sui, C. (2014). "Multimodal Speech Recognition with the AusTalk 3D Audio-Visual Corpus". Tutorial at Interspeech 2014.

Burnham, D. (2014). Big Data and Resource Sharing: A Speech Corpus and a Virtual Laboratory – Facilitating Research in Human Communication Science. Keynote address at Oriental COCOSDA (International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques) and concurrent meeting of the Conference on Asian Spoken Language Research and Evaluation, September 10-12, Phuket, Thailand.

Estival, D., Cassidy, S., Cox, F., Denis Burnham, D. (2014). “AusTalk: an audio-visual corpus of Australian English”. 9th Language Resources and Evaluation Conference (LREC 2014), Reykjavik, Iceland. Download PDF.

Sui, C., Haque, S., Togneri, R., & Bennamoun, M. (2012). "A 3D Audio-Visual Corpus for Speech Recognition". Paper presented at the SST2012, Sydney, Australia.

Sui, C., Haque, S., Togneri, R., & Bennamoun, M. (2012). "Discrimination Comparison Between Audio and Visual Features". Paper presented at the Asilomar 2012, Pacific Grove, USA. 

Burnham Denis, Dominique Estival, Steven Fazio, Felicity Cox, Robert Dale, Jette Viethen, Steve Cassidy, Julien Epps, Roberto Togneri, Yuko Kinoshita, Roland Göcke, Joanne Arciuli, Marc Onslow, Trent Lewis, Andy Butcher, John Hajek and Michael Wagner. (2011). "Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box". In Interspeech 2011. Florence, Italy, 2011. Download PDF.

Christou, Maria. (2011). "Isn't it Romantic": Discerning the Phonetic Properties of Speech Directed at Lovers and Strangers. Honours Thesis. Dept. of Psychology. University of Western Sydney.

Wagner, M., D. Tran, R. Togneri, P. Rose, D. Powers, M. Onslow, D. Loakes, T. Lewis, T. Kuratate, Y. Kinoshita, N. Kbp, S. Ishihara, J. Ingram, J. Hajek, D.B. Grayden, R. Göcke, J. Fletcher, D. Estival, J. Epps, R. Dale, A. Cutler, F. Cox, G. Chetty, S. Cassidy, A. Butcher, D. Burnham, S. Bird, C. Best, M. Bennamoun, J. Arciuli, and E. Ambikairajah. "The Big Australian Speech Corpus (the Big Asc)". (2010). In 13th Australasian International Conference on Speech Science and Technology, edited by M. Tabain, J. Fletcher, D. Grayden, J. Hajek and A. Butcher, pp.166-70. Melbourne: ASSTA, 2010. Download PDF.

Burnham, D., E. Ambikairajah, J. Arciuli, M. Bennamoun, C.T. Best, S.  Bird, A.B. Butcher, C. Cassidy, G. Chetty, F.M. Cox, A. Cutler, R. Dale, J.R. Epps, J.M. Fletcher, R. Goecke, D.B. Grayden, J.T. Hajek, J.C. Ingram, S. Ishihara, N. Kbp, Y. Kinoshita, T. Kuratate, T.W. Lewis, D.E. Loakes, M. Onslow, D.M. Powers, P. Rose, R. Togneri, D.  Tran, and M.  Wagner. "A Blueprint for a Comprehensive Australian English Auditory-Visual Speech Corpus".  (2009).  In The 2008 HCSNet Workshop on Designing the Australian National Corpus, pp.96-107. Sydney: Somerville, MA, USA: Cascadilla Proceedings Project. Download PDF.


Public presentations

Presentation at the ACSRF Supercomputing Workshop, Hunan University,Changsha, China, June 2013. Download PDF.

Presentation at the HAIL Seminar, CSIRO, Sydney, April 2012. Download PDF.

Presentation at the New Zealand Institute of Language, Brain and Behaviour, University of Canterbury Christchurch, NZ, November 2011. Download PDF.

Presentation at Interspeech 2011. August 2011.

Presentation at the Thirteenth Australasian International Conference on Speech Science and Technology 2010 Melbourne, Australia, December 2010. Download PDF.


Selected references

