Big data serves up linguistics insights

British Academy event details potential in faster, better routes to statistical analysis

May 29, 2014

Source: Getty

Breakfast test: worthwhile research can now take place in the time it takes to eat

Meaningful research into linguistics can now be conducted in the time it takes to have breakfast, thanks to the “transformative” impact of “big data” on the field.

That is the view of Mark Liberman, Christopher H. Browne distinguished professor of linguistics at the University of Pennsylvania, who told a panel discussion that “datasets are no longer the exclusive preserve of the scientific hierarchy” and that “any bright undergraduate with an internet connection can access and interpret the primary data”.

To illustrate his point during a recent event at the British Academy, he detailed how he had conducted his own “breakfast experiment” to ascertain whether there was any truth in the received wisdom that men and older people tend to be more “dysfluent” in their speech.

Professor Liberman performed a rapid statistical analysis over coffee and cornflakes of the number of “ums” and “uhs” in 2,500 hours of recorded and transcribed telephone conversations, classified by age and gender, that are available online.

While “uhs” performed as expected, “ums” seemed to buck the expected trend, leading Professor Liberman to speculate: “Are we seeing a substitution of ‘um’ for ‘uh’, with women leading the way?” Although such quick scans were “not a substitute for serious research”, it took him a mere 60 seconds to access the data, 5 minutes to create the graphs and 45 minutes to post a blog about it on the Language Log website.

Just as the microscope and telescope had opened up whole new worlds to investigate, he argued, thanks to big data “we can now observe linguistic patterns in space, time and cultural context, on a scale three to six orders of magnitude greater than in the past”.

Also speaking at the Language, Linguistics and the Data Explosion discussion, held earlier this month in conjunction with the Philological Society, were Sali Tagliamonte, professor of linguistics at the University of Toronto, and Philip Durkin, principal etymologist and deputy chief editor of the Oxford English Dictionary.

Professor Tagliamonte considered how different kinds of datasets can track patterns in language variation by sex, age, education and place, and what it reveals about the norms and practices of social groups.

Dr Durkin pointed to the immense value of “huge new digital resources, such as Early English Books Online” to scholars compiling historical dictionaries. However, he said, it remained to be seen how future scholars would strike a balance between “traditional reading, human combing of databases, and automated trawling and sketches”.

matthew.reisz@tsleducation.com

Times Higher Education free 30-day trial

You've reached your article limit

Register to continue

Registration is free and only takes a moment. Once registered you can read a total of 6 articles each month, plus:

  • Sign up for the editor's highlights
  • Receive World University Rankings news first
  • Get job alerts, shortlist jobs and save job searches
  • Participate in reader discussions and post comments
Register

Have your say

Log in or register to post comments

Featured Jobs

International Student Support Assistant YORK ST JOHN UNIVERSITY
Senior Lecturer: Architecture (Cultural Content) NORWICH UNIVERSITY OF THE ARTS
Head of Department of Physics ZHEJIANG UNIVERSITY
Research Assistant LONDON SCHOOL OF ECONOMICS & POLITICAL SCIENCE LSE
Lecturer in University Study Skills UNIVERSITY OF HAFR AL BATIN

Most Commented

question marks PhD study

Selecting the right doctorate is crucial for success. Robert MacIntosh and Kevin O'Gorman share top 10 tips on how to pick a PhD

India, UK, flag

Sir Keith Burnett reflects on what he learned about international students while in India with the UK prime minister

Pencil lying on open diary

Requesting a log of daily activity means that trust between the institution and the scholar has broken down, says Toby Miller

Application for graduate job
Universities producing the most employable graduates have been ranked by companies around the world in the Global University Employability Ranking 2016
Construction workers erecting barriers

Directly linking non-EU recruitment to award levels in teaching assessment has also been under consideration, sources suggest