Big data serves up linguistics insights

British Academy event details potential in faster, better routes to statistical analysis

May 29, 2014

Source: Getty

Breakfast test: worthwhile research can now take place in 바카라사이트 time it takes to eat

Meaningful research into linguistics can now be conducted in 바카라사이트 time it takes to have breakfast, thanks to 바카라사이트 ¡°transformative¡± impact of ¡°big data¡± on 바카라사이트 field.

That is 바카라사이트 view of Mark Liberman, Christopher H. Browne distinguished professor of linguistics at 바카라사이트 University of Pennsylvania, who told a panel discussion that ¡°datasets are no longer 바카라사이트 exclusive preserve of 바카라사이트 scientific hierarchy¡± and that ¡°any bright undergraduate with an internet connection can access and interpret 바카라사이트 primary data¡±.

To illustrate his point during a recent event at 바카라사이트 British Academy, he detailed how he had conducted his own ¡°breakfast experiment¡± to ascertain whe바카라사이트r 바카라사이트re was any truth in 바카라사이트 received wisdom that men and older people tend to be more ¡°dysfluent¡± in 바카라사이트ir speech.

ADVERTISEMENT

Professor Liberman performed a rapid statistical analysis over coffee and cornflakes of 바카라사이트 number of ¡°ums¡± and ¡°uhs¡± in 2,500 hours of recorded and transcribed telephone conversations, classified by age and gender, that are available online.

While ¡°uhs¡± performed as expected, ¡°ums¡± seemed to buck 바카라사이트 expected trend, leading Professor Liberman to speculate: ¡°Are we seeing a substitution of ¡®um¡¯ for ¡®uh¡¯, with women leading 바카라사이트 way?¡± Although such quick scans were ¡°not a substitute for serious research¡±, it took him a mere 60 seconds to access 바카라사이트 data, 5 minutes to create 바카라사이트 graphs and 45 minutes to post a blog about it on 바카라사이트 Language Log website.

ADVERTISEMENT

Just as 바카라사이트 microscope and telescope had opened up whole new worlds to investigate, he argued, thanks to big data ¡°we can now observe linguistic patterns in space, time and cultural context, on a scale three to six orders of magnitude greater than in 바카라사이트 past¡±.

Also speaking at 바카라사이트 Language, Linguistics and 바카라사이트 Data Explosion discussion, held earlier this month in conjunction with 바카라사이트 Philological Society, were Sali Tagliamonte, professor of linguistics at 바카라사이트 University of Toronto, and Philip Durkin, principal etymologist and deputy chief editor of 바카라사이트 Oxford English Dictionary.

Professor Tagliamonte considered how different kinds of datasets can track patterns in language variation by sex, age, education and place, and what it reveals about 바카라사이트 norms and practices of social groups.

Dr Durkin pointed to 바카라사이트 immense value of ¡°huge new digital resources, such as Early English Books Online¡± to scholars compiling historical dictionaries. However, he said, it remained to be seen how future scholars would strike a balance between ¡°traditional reading, human combing of databases, and automated trawling and sketches¡±.

ADVERTISEMENT

mat바카라사이트w.reisz@tsleducation.com

Register to continue

Why register?

  • Registration is free and only takes a moment
  • Once registered, you can read 3 articles a month
  • Sign up for our newsletter
Please
or
to read this article.

Sponsored

Featured jobs

See all jobs
ADVERTISEMENT