9781351330114
9781351330114
Understanding Linguistic
Fieldwork
This book is essential reading for students studying modules relating to linguistic
fieldwork or those looking to embark upon field research.
Series Editors:
Understanding
Linguistic
Fieldwork
Felicity Meakins
Jennifer Green
Myfany Turpin
iv
Contents
1 Introduction 1
1.1 Overview 2
1.2 Definitions 3
1.2.1 What makes a good field linguist? 3
1.2.2 ‘Insider’ and ‘outsider’ linguists 4
1.2.3 What is meant by ‘fieldwork’? 6
1.2.4 Linguistic description versus language documentation 8
1.2.5 Language vitality 10
1.2.6 Who is a speaker? 11
1.3 Your project 12
1.4 Workflow from go to woe 16
1.5 About us (first person plural exclusive) 21
1.6 Summary 22
1.7 Further reading 22
References 22
vi Contents
Contents vii
6 Morpho-syntax 121
6.1 The value of formal elicitation for grammatical description 122
6.2 What language to perform elicitation in? 123
6.3 What equipment to use? 124
6.4 Establishing parts of speech 124
6.5 Getting started with clause-level elicitation 126
vi
viii Contents
Contents ix
x Contents
Contents xi
Appendices 289
Appendix 1: Map of major languages referred to in this book 290
Appendix 2: Answers to exercises 291
Appendix 3: Glossary 315
Appendix 4: Acronyms 320
Index 321
xi
Figures
1.1 Field methods workflow used for the Gurindji history project 17
1.2 Recording stories near Ngangi 18
1.3 A CLAN excerpt of Ronnie Wavehill’s story transcribed in Gurindji and
translated into English 19
1.4 Excerpt from Bilinarra, Gurindji and Malngin Plants and Animals 20
2.1 An example of a participation statement developed by Caroline Jones
for a project on Kriol 36
3.1 Horace Watson recording Mrs Fanny Cochrane Smith. Sandy Bay,
10 October, 1903 49
3.2 View of an audio waveform in Audacity that shows a well-recorded
segment of speech contrasted with a segment where the levels were
set way too high and there is extreme clipping 63
3.3 Recording Kaytetye sand stories. Carol Thompson, Myfany Turpin and
Tommy Thompson 67
4.1 Metadata attached to a recording made on a cassette tape 78
4.2 A simplified view of the structure of a relational database for keeping
track of metadata 78
4.3 Representation of a screenshot from an ELAN annotation file with
a video clip and an audio file imported 81
4.4 Bush potatoes as an illustration of a “hub and spokes” model of archives 87
5.1 Drawing on native speaker intuitions about tone in Tai Phake 108
5.2 An example of one of the 35 Kaytetye target words and corresponding
images (here ‘in the scrub’ [ɐ'ʈɳeŋ]) in a carrier phrase ‘Say X!’,
pre-recorded by a native speaker. 111
5.3 Limbardo Payaguaje, a Shiwiar speaker, demonstrates the
use of the earbuds. The updated methodology positions just
one earbud under (rather than in) the nose and one next to the
mouth. Limbardo was keen to demonstrate the original
positioning which he found humorous 114
6.1 Groupings of A (transitive subject), S (intransitive subject) and O (object)
in an accusative system (non-dotted line) and ergative
(dotted line) system 131
6.2 Silverstein’s hierarchy and ergative/accusative splits 133
xi
Tables
Acknowledgements
Our heartfelt gratitude is first and foremost to all of the speakers of endangered
minority languages who have mentored, trained and hosted us during our extensive
times in the field. Although there are too many to mention in person we extend
particular thanks to Cassandra Algy, Topsy Dodd, Lizzie Marrkilyi Ellis, Clarrie
Long Kemarr, April Campbell Pengart, Veronica Dobson Perrurle (AM), Hilda
Price Pwerl, Eileen Campbell Pwerrerl, Alison Ross, Tommy Thompson, Margaret
Kemarre Turner (OAM), Violet Wadrill, and Biddy Wavehill. We would not have
written this book without the lessons they have taught us. We are also immensely
grateful to those who provided personal reflections on their own experiences as
fieldworkers, or those who gave us access to field materials: Grev Corbett, Nicholas
Evans, Murray Garde, Diana Guillemin, Al Harvey, Stephen Levinson, Miriam
Meyerhoff, Lev Michael, Marianne Mithun, Jane Simpson and Jesse Stewart. We are
also enormously grateful to the many linguists and others who read and commented
on drafts of the chapters in this book. Thomas Allen gave us direction on the legal
content of Chapter 2, and Lesley Woods kindly gave us access to her thesis on ethics in
linguistic research with Indigenous communities. Nick Thieberger, Margaret Carew
and Ben Foley provided feedback on Chapters 3 & 4, and Katie Jepson gave advice
about audio recording settings in Chapter 3. Thanks to Michael Proctor, Andrew
Nevin, Tom Ennever, and Isabel O’Keeffe for suggestions for Chapter 5 on phonol-
ogy. Rachel Nordlinger provided valuable input and ideas on Chapter 6 on gram-
matical elicitation. David Osgarby, Ivan Kapitonov, David Nash, Nay San and Mary
Laughren provided helpful language-specific advice on aspects of Chapter 7. We
thank David Wilkins, Gabrielle Hodge, Connie de Vos, Lizzie Marrkilyi Ellis, Inge
Kral, Onno Crasborn, Han Sloetjes, Lauren Reed and Lauren Gawne for their feed-
back and assistance on Chapter 8, and Evan Kidd, Sabine Stoll, Carmel O’Shannessy
and Bill Forshaw for commenting on Chapter 9 on child language acquisition.
Evangelia Adamou provided feedback on Chapter 10 on contact languages. We
thank Linda Barwick and Stephen Morey for commenting on Chapter 11, Brenda
Thornley for the map that appears in this publication, and Maxine Addinsall for
many of the original figures. Other photos and illustrations have been reproduced
with the permission of the Tasmanian Museum and Art Gallery, IAD Press, Onno
Crasborn, Carmel O’Shannessy, Caroline Jones and Jenny Taylor. Thanks also to
Mignon Turpin for assistance with copyediting and Vivien Dunn for producing the
index. We also thank the series editors, Bernard Comrie and Grev Corbett, for their
thorough and thoughtful feedback on the penultimate version of the manuscript.
The time to write this textbook would not have been possible without our various
xvi
Acknowledgements xvii
Linguistic abbreviations
abs absolutive
acc accusative
all allative
antipass antipassive
atel atelic
aug augmented
cat catalyst
comp complementiser
cont continuous
contr contrast
dat dative
der derivational
dir directional
dub dubitative
dur durative
emph emphatic
erg ergative
exc exclusive
f feminine
fem feminine
foc focus
fut future
imp imperative
impf imperfect
inc inclusive
infl inflectional
inst instrumental
loc locative
m masculine
masc masculine
mid middle voice
min minimal
n neuter
neut neuter
nom nominative
o object
newgenprepdf
xi
obl oblique
only restrictive
pauc paucal
pl plural
prog progressive
prop proprietive
prs present
pst past
red reduplication
rel relativiser
s subject
seq sequential
sg singular
smsg same patrimoiety, same generation moiety
subj subject
tel telic
top topic
tr transitive
ua unit augmented
v vegetable
veg vegetable
1 first person
2 second person
3 third person
- morpheme break
= clitic break
x
1
Introduction
DOI: 10.4324/9780203701294-1
This chapter has been made available under a CC-BY-NC-ND license.
2
2 Introduction
1.1 OVERVIEW
In a world where increasing numbers of languages are losing speakers (§1.2.5), the
place of fieldwork in the discipline of linguistics and its relevance for speech com-
munities is gaining importance. The term ‘fieldwork’ has been used to describe lin-
guistic research in many different contexts (§1.2.3), but the focus of this textbook is
on the methods, ethical practices and procedures for researching under-described
languages in endangered language settings. We pay particular attention to fieldwork
in countries and continents which have undergone European or other expansion
(e.g. Australia and the Pacific, the United States, Canada, South America, Africa)
and the roles and responsibilities of linguists in engaging with communities in these
settings (cf. Austin & Sallabank, 2011).
If we are to understand more about the rich diversity of the world’s remaining
languages, then fieldwork is essential. Taking stock of the inventory of linguis-
tic structures across the world’s languages and their parameters of use is basic to
an understanding of the human language faculty. Capturing this diversity means
undertaking research in remote areas of the world and getting training in how to
work in an ethical and responsible manner (Chapter 2) and in how to collect and
curate quality linguistic data (Chapters 3–4). This data is important to all of the
standard subfields of linguistics, e.g. phonology, morpho-syntax and semantics
(Chapters 5–7), and also to other areas of linguistics, for example typology and
historical linguistics. Some of these sub-disciplines require particular methods
and ethical considerations. As such, we devote chapters to discussion of gesture and
sign (Chapter 8), language acquisition (Chapter 9), language contact (Chapter 10),
and poetry and song (Chapter 11). In doing so, we take a broad view of language,
including not only speech, but also multimodal forms of communication, and the
performative and visual dimensions of language. Many of the discussions and sce-
narios presented in this book also include other disciplines that involve the collec-
tion of primary linguistic data of under-described languages such as ethnobiology
and anthropology.
Not all of the things you will learn in this book can be practised in your field
methods class where the language, the speaker and ethics processes have all been
organised by your lecturer. You will also probably work with only one speaker
in class, which tends to give consistent data, whereas in the field you will work
with multiple speakers. As a result, you are likely to encounter inter-speaker vari-
ation and other complexities of language-in-use. Finding the systematicity in this
seemingly ‘messy’ data can be challenging (but fun!). Another difference is that
in a field methods class the speaker is usually the outsider, whereas in the actual
field you will be the outsider unless you are an ‘insider’ linguist (§1.2.2). Finally,
previous documentation of the language is usually out of bounds for field meth-
ods students as you are expected to work things out from scratch. In the field you
should be on top of all that has been written on the language before you go there
(§2.7.4 & §6.6.1).
3
Introduction 3
1.2 DEFINITIONS
Before we launch into the details of field methods, it is important to think about what
we mean by ‘field linguist’ and ‘fieldwork’. Another distinction –between ‘linguistic
description’ and ‘language documentation’ –has also become very important in
recent decades to the way that fieldwork is conceptualised and planned. How you
undertake fieldwork may vary depending on how endangered the language you
plan to work on is and the varying skills of the language team.
4 Introduction
conversation and laughter, people talking all at once but always in collabo-
ration. At one point I noticed a ring of faces around the doorway at the
back of our room, each registering wide-eyed wonder. They belonged to
members of a colleague’s group who, drawn by the noise, had been observ-
ing in silence. As my colleague later explained, in their culture people nor-
mally do not talk unless they have something to say. This friend, who is
naturally taciturn, conducted class by simply starting to write on the black-
board. Participants would drift in, watch for a while, then eventually try
it themselves. Sometimes when I would pass by their room I would think
at first that everyone had left, only to discover that the room was full of
people working. My friend explained that this is a traditional way of learn-
ing: children watch how things are done, then gradually imitate what they
have observed. The two of us are extremely grateful that we each landed with
the group we did. Now many years later, I still feel extraordinarily fortunate
to have these amazing Mohawks and their language in my life. And it started
with a happy accident.
What makes a good field linguist is someone who has an interest in understand-
ing other languages and cultures and enjoys working with people. The qualities of
patience, humility, humour and the ability to think laterally also help to make a
good fieldworker. A good ear, a knack for seeing patterns and a respect for data –
even obsession –without losing sight of the big picture, are other abilities that also
serve the field linguist well.
Introduction 5
identify potential ethical dilemmas quickly. There are some pitfalls, however. As
James Crippen (2009), a Tlingit linguist from British Columbia points out, because
‘home’ and the ‘field’ are one and the same, there can be many demands made
on ‘insider’ linguists:
(W)hile at home, ‘real life’ may interfere with research; consultant patience … may be
lower; restricted data may not be publishable … [the insider linguist] may become a polit-
ical proxy for non-linguistic issues, may be blamed for language policy failures, and may
be expected to be a language teacher and not a researcher … The expert role the [insider]
linguist assumes may also alienate them from their peer group.
‘Outsider’ linguists are external to the speech community and are often mem-
bers of the dominant group in the very country that may be responsible for
the marginalisation of the speech community. All ‘outsider’ linguists need to
be mindful of the impact of their research projects in communities, understand
fully their responsibility to the community, communicate openly with the lead-
ers in those communities, and negotiate projects with speakers (§2.4). While
many ‘outsider’ linguists may have a political or ideological investment in their
project, and may be motivated by notions of ‘saving’ languages and support-
ing speakers of endangered languages, it is also important to be sensitive to
the notion that “the line between empowerment and imperialism is very thin”
(Kusters, 2012, p. 42).
A final category of field linguist is the ‘insider-outsider’ linguist. These are peo-
ple who are from a marginalised group themselves, but not from the speech com-
munity which is the focus of their research. Sometimes they come from another
minority language group in the same country and can identify with many local
issues of disempowerment. For example, Annelies Kusters is a deaf linguist who has
conducted research on sign languages in Ghana.
As a deaf person, I understand certain deaf-related experiences from the inside out,
for example, being primarily visually oriented and experiencing barriers … There were
other obvious limitations in my understanding: I am deaf, but I am not Adamorobee, not
Ghanaian and not black. I am deaf, but I did not grow up with sign language and I experi-
enced oralism. I am deaf and I can read; I am educated, while they are not. I am deaf and
I married a deaf person without any problem, a right that they cannot enjoy. In short: I am
an insider in terms of being ‘biologically deaf ’ and certain socio-cultural experiences that
come with it but I am an outsider in many other domains.
(Kusters, 2012, pp. 39, 40)
For both the ‘outsider’ and ‘insider-outsider’ linguist, a sensitivity to the pain of
language loss may be required. People who have lost their language due to the colo-
niser society (which you may be from) may feel upset that an outsider has the ability
to learn their ancestral language, has access to funds and is paid to work on their
ancestral language with their elders. Giving thought to ways of ensuring that com-
munity members have key roles in the field project and building opportunities for
language learning and training into the project can help alleviate some of these
concerns.
6
6 Introduction
Although partly in jest, Crowley’s definition does explicitly reject ‘urban fieldwork’
as fieldwork. Hyman (2001) has a similar perspective on what constitutes ‘the field’.
Urban fieldwork is a more recent phenomenon where languages of diaspora groups
(often refugees or economic migrants) are studied in Western settings, often in so-
called ‘super diverse’ cities such London or Auckland. Urban fieldwork often refers
to research in the city or town where you, as a linguist, live. This type of fieldwork
works well for linguists who are unable to leave home for extended periods due to
a lack of research funds, caring or family commitments, or heavy teaching loads. It
also may not be possible to travel to a remote field site due to political unrest and
other circumstances. Urban fieldwork is also a great opportunity to build diverse
networks in your own city. It can also be a chance for a linguist to make connections
with a diaspora group in a pilot study with a view to conducting fieldwork in the
original speech community if funding becomes available.
We are mostly concerned with methods for documenting fragile languages
in contexts of language endangerment and shift (§1.2.5). For the most part,
this means fieldwork is often undertaken in remote locations, but this does
not exclude research in urban contexts. For example, some lesser- studied
languages, particularly in India, South America and Africa, have been spo-
ken in large urban centres for a long time, by speakers who are embedded in
well-developed social networks and cultural practices. In other cities, under-
described languages are spoken by newer speech communities, for example
7
Introduction 7
Exercise 1 Discuss why you think the following are or aren’t examples
of fieldwork?
• You speak an undescribed, endangered language. You are writing a gram-
mar of your language and you decide which sentences are grammatical
and ungrammatical yourself.
• You are working with a refugee group in the city where you live, docu-
menting their language.
• You and some other students are in a field methods class at your univer-
sity working with a speaker of a small language spoken elsewhere in your
country.
• You work on your own language in the field which you call home.
• You live in Kuala Lumpur and record yourself outside pronouncing words
to figure out the vowel space of your English variety and are infected with
malaria in the process.
• You are a phonetician and accompany another linguist to the field and
spend a week making enough recordings for your thesis.
• You hear that speakers of an endangered language from western China
are visiting the anthropology department and you conduct a number of
psycholinguistic tasks in the lab while they are there.
In this book we discuss the most challenging types of fieldwork situations, and
give strategies that may help in a whole range of fieldwork contexts. Some will
be of value more generally, and some may not apply in urban settings or for the
‘insider’ linguist. Whether work is carried out in remote areas or urban contexts, by
an ‘insider’ or ‘outsider’, the field linguist enters into a partnership with the speech
community; and data is created and analysed in the context of, and as a result of,
these partnerships.
In this textbook we provide steps in how to develop an audio-visual corpus
through the collection of a range of linguistic data including texts (narrative,
procedural, conversation etc.) and elicitation. From this corpus, the field linguist
seeks to elucidate the lexical and constructional resources within a language, their
formal properties and expressive scope. We also consider linguistic fieldwork to
be research that is embedded within its cultural and social context. Indeed Hill
(2006, p. 113) suggests that “linguists need to be ethnographers, because they
venture into communities that may have very different forms of language use
from those of the communities in which they were socialised as human beings or
trained as scholars”.
8
8 Introduction
This definition of fieldwork comes with particular contexts that require specific
methodologies. For example, endangered languages often have small numbers
of speakers, and so the quantitative methods standard to phonetics (§5.9), lan-
guage acquisition (and psycholinguistics) (§9.4.2) and contact linguistics (§10.4)
often require some adjustment and compromises. For example, sociolinguistic
experimental tasks such as matched-guise tests require anonymity to work prop-
erly. Matched-guise tests involve participants listening to recordings of speak-
ers reading a passage of text and making judgements about speaker attributes
(e.g. class, generosity, intelligence) based on their speech. In small communities
where everyone knows each other these methods are compromised. Participants
in small communities tend to rate the speaker based on what they know about
the person rather than on their speech. Similarly, grammaticality judgements of
semantically weird sentences such as “colourless green ideas sleep furiously” may
be problematic in communities whose theories of language are highly cultur-
ally embedded (§6.6.1). Finally, all remote communities in endangered language
contexts involve members of what ethics committees often refer to as ‘vulnerable
populations’. In these situations very specific thought about how well partici-
pants understand the project and their options for archiving and future use of
recordings is required in order to obtain truly informed consent (§2.5). This is
even more the case if your work involves deaf communities (§8.5.4) or children
(§9.3.4), or languages held in low regard by the speech community such as creole
languages (§10.3.4).
Introduction 9
(T)exts offer access to questions of grammar, style, history, and culture; furthermore, they
provide crucial insights into indigenous perspectives or ‘fundamental ethnic ideas’ …
The text-based Boasian framework prioritizes the study of language on its own terms and
within its wider cultural context, and in so doing gestures to a more collaborative rela-
tionship among community outsiders and insiders in producing research.
(Epps, Webster, & Woodbury, 2017, p. 42)
10 Introduction
Documentary linguistics arose in the context of heightened concern over ‘language endan-
germent,’ that is, shifts, often radical, in the ecology of speaking, including the fall into disuse
of ancestral lexico-grammatical codes among the only people who knew them. This concern
developed among activist community members themselves as well as outside linguists.
(Epps, Webster & Woodbury, 2017, p. 57)
Theories of language, which have diversity at their core, have been one of the driv-
ers behind the push to document as many of the world’s languages as is possible.
Colonisation, globalisation, climate change and technological change have had,
and continue to have devastating effects on many areas of diversity. Of the 7,099
languages listed in Ethnologue,1 an online resource of endangered languages pro-
duced by SIL International, 457 are identified as no longer spoken (see also Lewis
& Simons, 2009). The heaviest areas of language loss have occurred in areas such as
North and South America, Africa, and the Pacific, including Australia, Papua New
Guinea and Melanesia.
Language vitality is often measured by Fishman’s (1991, pp. 87–109) eight-stage
Graded Intergenerational Disruption Scale (GIDS) which has been adopted by
UNESCO2 and since expanded by Ethnologue (see Table 1.1).3
GIDS measures both active knowledge of languages, as well as comprehension,
along a number of axes including generation (are there child learners?), numbers
of speakers, language practices and literacy (is the language used in the school
curriculum?) and official recognition by regional governments. These factors are
discussed in more detail in the UNESCO Language Vitality and Endangerment
(2003) report. A good overview of the various instruments for measuring language
Table 1.1
Fishman’s Graded Intergenerational Disruption Scale (GIDS)
Stage 1 Some use of the language in higher education, occupational, national governmental
and media efforts
Stage 2 The language is used in regional governmental services and mass media
Stage 3 The language is used in work environments (outside of the core speech community)
involving interaction between native and non-native speakers of the language
Stage 4 The language is used in primary education that meets the requirements of compulsory
education laws
Stage 5 There is literacy in language in home, school and community life, but without taking
on extra-communal reinforcement of such literacy
Stage 6 Intergenerational oral transmission takes place and is reinforced in the community and
some institutions in the community e.g. church
Stage 7 Users of the language are a socially integrated and ethnolinguistically active
population but they are beyond child-bearing age
Stage 8 Most remaining users of the language are socially isolated old folk and knowledge of
the language is reliant on older sources
1
Introduction 11
endangerment and vitality is given by Obiero (2010) and more extended discus-
sions can be found in Tsunoda (2005).
The field methods you use need to take into account these different kinds of lan-
guage ecologies. You may be working in a monolingual context and ‘monolingual
field methods’ will be necessary (§6.2). Alternatively, you may find yourself in a
bilingual context, often a situation of language shift in the case of endangered lan-
guages. In these cases, the target language is often used by a minority group or an
older generation. In this scenario, you will probably find yourself working ‘through’
the lingua franca of the community, often a contact language such as a creole. Some
caution is necessary when conducting fieldwork in these contexts because delineat-
ing the linguistic entity to describe is trickier than in contexts where there is a clear
difference between the target language and the lingua franca. But comparisons with
earlier descriptive work and a good working knowledge of the contact language can
help sort out where specific language structures originate.
One important point to bear in mind is that the language vitality conversation has
been largely dominated by those who work on endangered heritage languages. Yet in
the place of endangered languages there is often a dynamic array of contact varieties of
English, Spanish and French, creole languages and other contact varieties of the heritage
languages (Chapter 10). This diversity and these radically changed language ecologies
all need attention and documentation. Focusing on heritage languages at the expense of
new languages can be disempowering to the younger emergent language leaders.
12 Introduction
actively participate themselves, but provide an attentive audience for active speak-
ers during recording sessions. Where possible, it is a good idea to work with teams
of people to draw on their different interests and abilities. It also helps iron out what
features are the result of inter-speaker variation, what categories are shared in a
speech community and what parts of the language might be undergoing language
change, which is common in language endangerment situations (Chapter 10). You
may be lucky and find a speaker who can distinguish intuitions about their own
speech from that of other people in their speech community (e.g. “older people
pronounce it X, while younger people pronounce it Y”; or “my father uses it to
mean X, but my brother uses it to mean Y”). It is important to document which
speakers have provided the intuitions on which your analysis is based. Make sure to
include the language background of speakers who provide intuitions, for example
which other languages they speak (§4.2.3). Tsunoda (2005, pp. 117–134) provides a
detailed discussion of different speaker types and Morey (2010, p. 93) has a detailed
discussion about differences between native speaker intuitions.
All fieldwork begins with formulating a research project. Because fieldwork requires
a high level of investment (time, money, emotion), you should not undertake it
without careful thought to your research questions. Before embarking on fieldwork,
you need to negotiate and refine the ways that your research aligns with, and hope-
fully extends and interrogates the theoretical frameworks most relevant to your
task. Long ago, in a letter to a friend in 1861, Charles Darwin commented on this
relationship between theory and practice.4
About 30 years ago there was much talk that Geologists ought only to observe and not
theorise; and I well remember someone saying that at this rate a man might as well go
into a gravel-pit & count the pebbles & describe their colours. How odd it is that every
one should not see that all observation must be for or against some view, if it is to be of
any service!
Without a theoretical framework, as Darwin implies, there would be no basis for the
inclusion or exclusion of anything. Theorising, data gathering and interpretation go
hand-in-hand. Terry Crowley (2007, p. 13), who pioneered fieldwork in Melanesia,
also cautions that “no descriptive linguist can carry out linguistic analysis of a
13
Introduction 13
Anyone who is engaged in fieldwork should also consider the ways that the needs
of endangered speech communities may be recognised, articulated and supported.
As Evans (2008, p. 345) points out, “anticipating the needs of language maintenance
and revival programs might point to certain types of data-gathering what would
otherwise be overlooked”. Below Evans describes his own first encounters with a
Nen-speaking community in Papua New Guinea who had been anticipating the
arrival of a missionary linguist. As Evans notes, the Bible translation work proved
invaluable for many aspects of his grammatical descriptive work, showing how
community work and linguistic research are inextricably connected.
14 Introduction
Introduction 15
16 Introduction
We now ‘unpack’ a real example from our own fieldwork to show all of the aspects
of documentation workflow process. The example comes from the Gurindji com-
munity in northern Australia who, for three decades, have been documenting their
language with ‘outsider’ linguists, including missionaries, academics and linguists
from a regional Indigenous-run language centre. This workflow comes from a
recent project which documents the early colonial history of the region through
the Gurindji language. The project brought together archival recordings, and
made new recordings to contribute to growing the Gurindji corpus. We now put
17
Introduction 17
Figure 1.1
Field methods workflow used for the Gurindji history project
an imaginary GPS tracker on an excerpt of language and follow it from the corpus
to various outputs to give you a sense of the different paths your documentation
work can take. This particular example comes from extensive experience in one
particular field site and might not be a place to begin but it gives you a sense of the
directions your research can ultimately take. Figure 1.1 outlines the workflow and
information about each step is detailed below and illustrated.
(i) Th e first stage was the development of the project with the Gurindji
community. Over three decades, many Gurindji narratives (Dreamtime
or creation period, historical accounts and everyday stories) have been
recorded. Many speakers have since passed away and much of the knowl-
edge is increasingly fragile. Bringing together legacy recordings and mak-
ing new ones is important to the community. Meetings between linguists
and the Gurindji community ascertained that narratives associated with
the Dreamtime were not appropriate for dissemination beyond the com-
munity, but the historical accounts had broader interest to the Australian
community. The meetings took place at Karungkarni Art and Culture
Aboriginal Corporation which represents the language and culture inter-
ests of Gurindji people. With the assistance of this organisation, fund-
ing was sought from a non-academic government agency to enrich the
historical narratives with information about sites and artistic responses
to the stories. A year later, after successfully obtaining funding, further
meetings were held with stakeholders including elders, rangers from the
Central Land Council Murnkurrumurnkurru ranger group and artists
from Karungkarni. Elders decided which stories to include and which
places to visit to add visual documentation to the stories and to pass on
these stories to the younger generations, including the ranger group who
include in their work the documentation of cultural and historical knowl-
edge about their land, as well as practical tasks such as weed and feral
animal control.
(ii) We then begin the process of recording and enriching the stories (see
Figure 1.2). The primary ‘object’ of the documentation process is the
audio(-visual) recording (§3.3.3). In this case, Gurindji speaker Ronnie
Wavehill tells of the massacre of Gurindji people by colonists in the early
twentieth century near the site where it occurred. Other elders act as ‘wit-
nesses’ for the story, confirming or finessing detail. Also present are local
rangers, photographers documenting the sites, visual artists who later
paint responses to stories relating to their country, totems, family and
histories; and linguists who are interested in documenting the words and
grammar of Gurindji.
18
18 Introduction
Figure 1.2
Recording stories near Ngangi (Photo: Brenda L. Croft 2014)
The whitefellas came and surrounded the Gurindji. There was no hope; they were
only going to shoot. The Gurindji started hearing shots. South of there, at a
considerable distance, is a hill called Ngangi. It lies on the southern side down
from Number 17 Bore and on top of Ngangi is an Emu Dreaming; it’s where the
emu spirit stood and stopped.
ext the transcription and annotation of the story takes place. Ronnie
(iii) N
Wavehill’s story is media- linked in the software CLAN, transcribed in
Gurindji, and translated into English (Figure 1.3). Note how the speakers’
names are reduced to initials. These initials reflect the participant’s real first
name and surname. It is an example of partial anonymity where the speaker
is identifiable within the community but not to outside researchers §4.4.5.
For more discussion about transcription software including CLAN, see
§4.3.2. Annotations include morphological glossing, useful for grammatical
analysis (§6.7) and dictionary compilation (§7.6). Additional annotation
might include gesture (§8.7). Attention is also given to file naming to help
keep track of all media and annotation ‘objects’ associated with the recording
session. They are given unique identifiers, stored in the same folder and the
metadata is clearly recorded (§4.2.2).
19
Introduction 19
Figure 1.3
A CLAN excerpt of Ronnie Wavehill’s story transcribed in Gurindji and translated into English
(Illustration: Maxine Addinsall)
1 @Begin
2 @Languages: gu
3 @Participants: RWH Ronnie Wavehill Speaker, ECH Erika Charola Investigator
4 @ID: gu||RWH|73;00.00|male||||Speaker|||
5 @Media: EC98_a027, audio
6 @Date: 30-MAY-1998
7 *RWH: Ngumpit kartipa-ma walilik na walilik-ma kartipa-ma. *
8 %eng: The kartiya came and surrounded the ngumpin.
9 *RWH: Najing, turlwak-kulu-rni, kuyangka-ma-lu turlwak-ma kurru nyanya. *
10 %eng: There was no hope; they were only going to shoot. The ngumpin
11 started hearing shots.
12 *RWH: Kurlarnimpa kula kajuparik nyila-ma kurlarnimpa ngarlaka-ma
13 nganayirla-ma Ngangi-ma, Ngangi na ngarlaka Number Seventeen-ta
14 kaarnimpa kuya, kankulupal ngarlaka Ngangi nyawa Yiparrartu nyamu
15 nyawa kurlarnimpa yani. Yiparrartu na — Dreaming karrinya
16 kutitijkarra. *
17 %eng: South of there, at a considerable distance, is a hill called Ngangi.
18 It lies on the southern side down from Number 17 Bore and on top of
19 Ngangi is the Emu Dreaming; it’s where the emu stood and stopped.
20
12oct 15[E|CHAT] ˙ 17 : W7.663-14.048: [Link].000: C al OMs
+H
•S
–H –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙ –∙
(iv) Afterwards consent for archiving and use of story was discussed. The story
was played back to a small group which included Ronnie Wavehill, other
custodians of knowledge and interested younger people. Details were dis-
cussed and anything deemed not suitable for broader dissemination was
noted. Consent was given by Ronnie Wavehill orally, as well as on a signed
form. All consent was witnessed by the group. The transcribed CLAN file
of Ronnie’s story and accompanying media were then archived according to
the conditions set by the Gurindji community during the consent process.
Read more about consent processes in §2.5 and archiving in §4.4.
(v) The documentation event had a number of research outcomes:
• Ronnie Wavehill’s (2016) story was published in a bilingual collec-
tion of Gurindji historical narratives for the Gurindji community
and broader Australian public called Yijarni: True Stories from
Gurindji Country.
20
20 Introduction
Figure 1.4
Excerpt from Bilinarra, Gurindji and Malngin Plants and Animals
Source: Hector et al., 2012, p. 95
Introduction 21
Finally, a word about the authors of this textbook. We bring together a combined
experience of many decades of fieldwork on Indigenous languages in Australia.
Our pathways to long-term linguistic work have been diverse: Green became
interested in the Alyawarr and Anmatyerr languages while working on art and
craft programs in a remote Indigenous community called Utopia in Central
Australia in the mid-1970s. Turpin became fascinated by Australian languages
while studying music at Melbourne University in the early 1990s, and went on
to work with Kaytetye people and other groups in Central Australia. Meakins
took a more traditional route, cutting her teeth on formal linguistic theory at the
University of Queensland in the mid-1990s before working with Ngarinyman,
Bilinarra and Gurindji people in northern Australia on school programs. We have
all worked in Indigenous-run language and arts centres, facilitating language
support programs and producing bilingual language resources such as diction-
aries, text collections and posters. We have always tried to bring a community-
orientated and collaborative approach to research as a part of our practice as
linguists. Our collective linguistic interests span many aspects of language: pho-
nology, semantics, morpho-syntax, sign and gesture, multimodality and verbal
arts, as well as language acquisition and contact languages. Although some of
the examples used in this textbook are from our own fieldwork in Australia (as
these are the ones we know best), we have attempted to reach out to colleagues
from all corners of the world and to represent as best we can the diversity of the
world’s languages. First-hand accounts by field linguists (in boxes in the text)
and case-studies (in shaded boxes) add to the breadth of the languages covered.
Our approach is inter-disciplinary, drawing on themes and techniques from
anthropology and musicology as well as more standard linguistic methods. The
widespread use of audio-visual recording, and the increasing value attached to
understanding how language is used in naturalistic everyday contexts also draws
linguistic and anthropological approaches closer together (Duranti, 1997, p. 98).
We also encourage research that is collaborative, reflexive and grounded in the
twin expectations of both language communities and research academies.
2
22 Introduction
1.6 SUMMARY
Knowing how to get started with fieldwork can be difficult. This textbook walks
you through the process. We begin with the preliminaries of fieldwork in Chapter 2
(identifying a field site, preparation, procedures, ethics) and proceed to chapters
that discuss recording equipment and how to use it (Chapter 3) and then workflows
through the processes involved in archiving your work (Chapter 4). We then look at
fieldwork on different aspects of language. As well as the classics of grammar writing
(Chapters 5–6) and semantic description and dictionary compilation (Chapter 7),
we take a broader approach to the descriptive object. We discuss methodologies
for conducting field research on anthropological and socio-cultural aspects of lan-
guage, sign language and gesture (Chapter 8), child language (Chapter 9), contact
languages, such as creoles (Chapter 10), and verbal art (Chapter 11). These chapters
cover the methodologies, ethics and procedures specific to these topics.
For a good overview of different views on what constitutes fieldwork, read Sakel &
Everett (2012, pp. 2–8). For a detailed history of linguistic fieldwork, see Chelliah
& de Reuse (2011). Read Evans (2011) for a beautifully written and wide-ranging
account of linguistic diversity and why we should care about endangered languages.
For more on endangered languages, see Austin & Sallabank (2011) and Sallabank
(2013). For accounts of fieldwork, read Crowley (2007), Dixon (1983), Sarvasy &
Forker (2018) and Everett (2008).
NOTES
REFERENCES
Austin, P., & Sallabank, J. (2011). The Cambridge handbook of endangered languages. Cambridge:
Cambridge University Press.
Bird, S., & Simons, G. (2003). Seven dimensions of portability for language documentation
and description. Language, 79(3), 557–582.
Boas, F. (1917). Introduction. International Journal of American Linguistics, 1(1), 1–8.
23
Introduction 23
24 Introduction
2.1 INTRODUCTION
In this chapter we first look at how you might go about choosing a community to
work with. We then discuss your responsibilities as a fieldworker to the language
consultants and their community; and to the institutions with whom you are affili-
ated. We discuss how to manage day-to-day concerns such as how to pay people,
living in the field, transport and safety, both for yourself and the people you work
with. Thinking about these issues is an essential part of planning for fieldwork. How
well these aspects of fieldwork are understood and managed can make all the dif-
ference between whether fieldwork is an enjoyable and successful experience or a
difficult one.
There are a number of well-worn paths to the field. If you are an ‘insider’ linguist
(§1.2.2) one option is to work with your own language. If you are an ‘outsider’
linguist you might respond to a community’s request for a linguist or consider
working on a language spoken by a community where you have pre-existing con-
tacts. Another option is to choose a place you have always wanted to go. Through
your own travels and experience, you may have a place or language in mind which
you might pitch to a potential supervisor/advisor. In this case, it is worth inves-
tigating whether there is an organisation in the region that supports language
activities, such as a language, culture or arts centre, a museum, an Indigenous
council, or health and education organisations. Volunteering or working for these
organisations is a good opportunity to get to know a speech community and their
concerns and interests, and, of course, the language itself. Another path is to join
an existing project at a university either as a research assistant, an intern or a PhD
student. It is ideal to apprentice yourself to a more experienced fieldworker who
can lead you through the process of ethics, funding applications, consent, travel
and methods.
The degree of endangerment of the language is another factor to consider.
Given that there are many languages on the brink of disappearing and only a
small number of fieldworkers, there are good reasons to prioritise highly endan-
gered languages over those with lots of speakers. Some people might want to
work on a language that is undocumented, and so the degree of previous docu-
mentation might be a factor in the decision. In contrast, you may want to focus
DOI: 10.4324/9780203701294-2
This chapter has been made available under a CC-BY-NC-ND license.
26
It goes without saying that it is not a good idea to go to a place where linguists
are not welcome. As Rice (2012, p. 414) notes, “(s)ome communities are pleased to
have researchers, and embrace their presence. In other communities, there is much
27
suspicion of researchers, their goals, and their values”. Fortunately, there are many
under-described languages with speakers who are happy to have a linguist come to
work with them. For obvious reasons, it is also a bad idea to go into war zones or
areas of civil unrest and areas where there are scarce resources that you would be
taking up.
Once you have decided on where you are going to do fieldwork start reading about
the area –the people, their language, culture and history, as well as the geogra-
phy of the region. Talk to other people who have worked in the area and ask for
tips on who to contact and the availability of power, housing, food, transport etc.
Think beyond linguists, as other researchers, teachers, health professionals and
28
government workers may have important local knowledge too. Start organising all
the research that has been done on the region and prepare a broad range of tasks
or questions that you plan to address in your own fieldwork. Create a bibliography
and a summary of the sociolinguistic context. The Ethnologue website is a good
place to start, although note that the information there may not be up to date.1
A useful set of areas of enquiry, outlined by Van der Veen & Medjo Mvé (2010), are
summarised here:
• Name(s) of the language and people who speak it. Distinguish between exo-
nyms (names attributed by others) and endonyms (self-attributed names).
• Classification of the language: phylum, group, subgroup etc.
• The estimated number of speakers, any variation in fluency and whether
there is multilingualism and, if so, with which languages.
• Attitudes towards their own, and neighbouring or dominant languages, as
well as the coloniser or dominant society.
• The estimated vitality of the language. See, for example, the UNESCO
criteria (§1.2.5).2
• Location of speakers (towns, villages, distances between localities, accessi-
bility of localities); and their mobility.
• Extent of internal diversification (dialects, sociolects).
• Known linguistic features, e.g. sound inventories, tone, noun classes. If nothing
is known, note those of neighbouring or closely related languages.
• Social and economic organisation.
• Information about socio-cultural constraints and taboos, special speech
registers, singing and signing practices, and their possible impact on lan-
guage use, vocabulary etc.
• Professional activities on language and language use (e.g. interpreting,
schooling, Bible translations, literacy).
• Access to Western education systems.
• History of the group (often oral traditions), e.g. migration stories, origin(s)
of the group.
Concern for social relationships and the impact of the research and the researcher
on the speech community should be a primary consideration in linguistic field-
work (Ahlers & Wertheim, 2009; Kroskrity, 2015, p. 154). Working in Indigenous
and minority communities requires particular sensitivity to issues of power and
control, especially if you are part of a majority culture or a more powerful sec-
tor of society. Minority groups often have a history of outsiders who presume to
know what is best for them. It is important to be sensitive to these issues and have
a collaborative approach to research (Rice, 2010, p. 29). There are two overarch-
ing basic principles of ethical research: respect for a person’s autonomy and rights
to make decisions about research participation, and beneficence, or how research
29
should maximise benefits and minimise harm (Singleton, Martin, & Morgan,
2015, p. 12). Put another way, “ethical behavior can be said to be based in a num-
ber of ‘r’ words –respect, relationships, reciprocity, and responsibility” (Rice,
2012, p. 427).
Linguistic fieldwork often depends upon a continuing relationship with speakers,
and the depth of the insights into particular features of a language improves over
time. The expectations of collaborators and the language community thus help shape
the results of the work. If you are not from the community, it is important to try to
understand the norms and lifestyles of the community, for example norms of polite-
ness and friendship. Be prepared to be flexible as there may be changing needs over
the course of your engagement with the speech community. You should consider
whether it would be fruitful collaborating with other organisations. Look at it from
a language speaker’s point of view: if the school or church and you are all developing
an orthography, it is better to do it together once rather than several times (§5.11)!
record, document, transcribe and translate (Seyfeddinipur, 2012; Wilkins, 1992). For
example, the National Breath of Life Archival Institute for Indigenous Languages in
the United States runs two-week workshops for community researchers, providing
training in revitalisation techniques and archive access;3 the Endangered Languages
Documentation Program (ELDP) runs in-country training programs in many
African countries;4 and the Documenting and Revitalising Indigenous Languages
(DRIL) program in Australia runs certified training courses for language communi-
ties.5 Opportunities for language speakers to take courses, attend conferences, per-
form their verbal arts and visit archives are of immense value to everyone and may
enrich the research project.
of culture than is dance, song and art. Where language is specifically connected to
particular geographical places (as is frequently the case) it often forms an important
aspect of a speaker’s cultural heritage and identity. This contrasts with international
languages such as English. While the social value of linguistic research on minority
and endangered languages is often framed in terms of ‘saving’ languages for human-
ity as a whole by recording them and writing them down, it is also crucial that con-
sideration is given as to how the results of research will be accessible to community
members (Wilkins, 1992, p. 173).
A community may have very definite views about how their language should
and should not be used. In some cases, Indigenous languages and their words have
been used commercially or by governments without a community’s consent. This
appropriation may have led to speakers deciding not to write their language down
or share it with outsiders. A community may not want anyone using their language
and the knowledge it encodes for commercial reasons without their permission, yet
in making recordings and writing down a language you may inadvertently make
this knowledge accessible to commercial interests.
vary as to what other aspects of culture are protected by law. For example aspects
of cultures that are orally transmitted and performance-based may not be pro-
tected by copyright laws, even though these are perhaps some of the most val-
ued aspects of language from the community point of view (Janke, 2009). Most
legal systems fail to recognise rights in communally held knowledge or cultural
expressions that are part of shared traditions. Knowledge of medicines, herbal
cures and other products can be protected under various patent acts. However,
this may not be the case if they are based on traditional knowledge handed down
over many generations. Even though ‘language’ does not get a specific mention,
some countries are signatories to various international declarations, such as the
Declaration on the Rights of Indigenous People (United Nations, 2007). Article
31 states:
Indigenous peoples have the right to maintain, control, protect and develop their cul-
tural heritage, traditional knowledge and traditional cultural expressions, as well as
the manifestations of their sciences, technologies and cultures, including human and
genetic resources, seeds, medicines, knowledge of the properties of fauna and flora, oral
traditions, literatures, designs, sports and traditional games and visual and performing
arts. They also have the right to maintain, control, protect and develop their intellec-
tual property over such cultural heritage, traditional knowledge, and traditional cultural
expressions.6
To address the fact that copyright laws do not always align with Indigenous peo-
ples’ expectations, a range of licensing frameworks have been developed, includ-
ing Creative Commons, Traditional Knowledge Licences, and software licences
such as MIT, Apache and GPL. Creative Commons (CC) licences are written to
conform to international legal treaties, are intended to be effective worldwide,
and are legal tools for licensing content. A resource can be released under one of
six main CC licences, allowing it to be copied, modified and/or redistributed, for
commercial or non-commercial use, according to a creator’s needs.7
Traditional Knowledge (TK) licences and labels can be used separately or in
combination with CC licences to add conditions of use and information about how
material should be respectfully and ethically used, reproduced or copied according
to community expectations and obligations. Under such licences material may be
labelled as open to all users, as restricted to women or men, as requiring attribution,
or as being for either community or commercial use. Although these licences are
not legally binding, they are informative and useful for developing rights awareness
for a resource’s users.8
in a good position to complete the formal ethics applications that most institu-
tions require before you can commence fieldwork. Some of these involve being
respectful and observant of particular community requirements about access;
moderation of content and the handling of secret or sacred materials; document-
ing informed consent; and understanding moral rights, including the right of
attribution. Custodians of knowledge may want to set boundaries about access to
particular types of content for particular groups of people. Apart from issues that
may be part of the standard processes of informed consent, such as document-
ing instructions for archiving, another way to assert a community’s language
rights in publications and research materials is to include specific wording. For
example:
(T)he language and information contained in this book includes the traditional knowl-
edge and cultural expression of the X people. The information is published for the pur-
poses of knowledge preservation, education and language maintenance.
Some publications state specifically what uses are not permitted. For example
Walmajarri Plants & Animals (Doonday et al., 2013) expands on this:
This information should not be used commercially in any way including in tour-
ism, food technology including bush tucker applications, medicines, pharmaceuti-
cal products, health and beauty products, storytelling or as trade marks [sic], patents
and designs, without observing the Aboriginal cultural protocols of prior informed
consent, attribution to traditional Aboriginal communities, cultural integrity and the
sharing of benefits.
As your research involves people, most universities require you to submit an appli-
cation to a human ethics committee before you can start your research. This is often
a statutory requirement that must conform to national standards, and it may be
a pre-condition set by the funding body. An ethics application usually includes a
number of documents that you will use in the field, including a project informa-
tion sheet and a consent form. Some researchers include examples of these docu-
ments in their theses. Start your ethics application early, as some universities can
take many months to give final approval.
Ethics applications require you to demonstrate an understanding of the ethical
issues of working with people in your disciplinary area. For a field linguist, this
means respecting the rights of the language speakers (‘research participants’) and
demonstrating that your research is of benefit to them or to society at large, or,
at the very least, won’t harm anybody. As Woods (2017, p. 88) notes, you should
35
try to “find ways in [your] own work practices, that take into account that con-
trol of language and cultural knowledge is a high priority for Indigenous people
and must be factored into any ethics applications and agreements processes”. For
example, you might explain that the community values their language yet it is
endangered and that your research will provide resources and opportunities to
participate in efforts to maintain the language. Bear in mind that most human
ethics applications have medical research in mind and are designed primarily to
minimise any physical risk to research participants and to ensure that they have
given informed consent.
You may need to explain why the identity of the research participants needs
to be maintained (not identifying names may be seen as disrespectful), rather
than anonymised (which is the norm for many areas of research). Furthermore,
in endangered language communities, which are often small and close-knit, ano-
nymity may be impossible within the community (Singleton, Martin & Morgan,
2015, p. 11). Although audio recordings are personal records, adding film to the
documentation mix adds another dimension (§8.5.5). You may need to identify any
potential risks (e.g. infringement of cultural and intellectual copyright) and how
you plan to mitigate these. When working with vulnerable groups such as children
(§9.3.4) and minority groups, you will need to show how you plan to ensure lan-
guage speakers have chosen to be research participants, rather than been coerced.
If you are going to provide financial compensation to participants, you will need to
state how much you will pay people and justify the rate (e.g. by citing rates used by
other local researchers or organisations).
You will also need to demonstrate that participants have the option to with-
draw from the research project, decide on the future of their recordings, including
whether to archive or not, and determine access conditions and even destroy any
recordings should they wish.
• contact details for the researcher and their university’s ethics committee
• who to contact if there is a complaint (for example a supervisor/advisor or
an ombudsman at the researcher’s university).
When formulating these statements, avoid technical language and jargon, and
use common words, active voice and short sentences. Put yourself in the shoes of
the interpreter who may have to translate your document! Ask your supervisor/
advisor or recently graduated students for examples. There should be no need to
reinvent the wheel! Although the information should not be too detailed, be pre-
pared to explain the types of linguistic data you hope to collect (how many peo-
ple, how long for, what topics), what it will be used for (e.g. PhD, publications in
book, journal, audio, film or web formats) and what will happen to it (where
will it be kept, who will access it and how?). Remember that some people will not
know what a PhD or journal article is, so you may need to think hard about how
to translate these concepts. Bringing an example to show may be a good idea.
Figure 2.1 shows a participant information sheet for a project involving
Australian Indigenous people. This ‘card’ is translated for the research
participant into north Australian Kriol by a local interpreter. Note that informa-
tion about short- term and long- term benefits to the community is clearly
spelt out.
Figure 2.1
An example of a participation statement developed by Caroline Jones for a project on Kriol
(N.B. identifying information has been deleted)
You are invited to take part in a research study. Our study is about how children learn to
talk, and what helps children.
You can make a choice. You don’t have to agree to take part.
You would join a group conversation, and fun activities like retelling a
story. We’re interested in normal everyday talk, what you think.
You can pull out whenever you want. We won’t ask why.
30–60 mins If you get upset we will help. Remember we have to
report serious criminal activity.
Long-term benefits: community, school, XXXXXXXX we’re learning info to help
families and teachers. No benefit for your family right now.
You can choose who sees your video and where we store it. We will write about
the study, but we will check drafts with you and the community first.
if we document broadly what research materials cannot be used for. Being as specific
as possible is important because, as Woods (2017, p. 87) cautions, “(a)greements
that include clauses about wide ranging non-specific, ongoing secondary uses of
language data is problematic [because] it denies Indigenous people the right to claim
their knowledge as their own and to protect it in any way whatsoever [and] the use
of such clauses shortcuts the need for ongoing consultation with the Indigenous
community”.
Similarly, when working with archival recordings, you may need to consult the
speaker (or their next of kin if they have passed away) and get their permission to
publish or reuse the recordings. Another issue that is often specified in consent
forms is the speaker’s right to ask the researcher to delete parts of the recordings
that may contain sensitive or unusable content. If such a request is made it is impor-
tant that the researcher keeps a record of the reasons why a particular recording, or
part of a recording, has been deleted.
Depending on the nature of the recordings, it may be a good idea to get consent
to publish the recording. For example, if it is a story that could be published in a
book or be made publicly available online, you should document the speaker’s or
the broader community’s views about this.
Fieldwork … requires the cooperation of several players to succeed. The tendency, none-
theless, has been to attribute all success to only one person –the lead researcher, and
failure to acknowledge the ‘other’ invisible players.
(Wasamba, 2015, p. 129)
Language speakers may invest large amounts of time and energy in teaching you
and may wish to be acknowledged for their contribution. Speakers and partici-
pants in a project have the right to be acknowledged for work they do on a project.
For example, if someone has undertaken consultation which ensures that the com-
munity understands the project, finds and arranges speakers to work with you, or
arranges a performance to be recorded, this contribution should be acknowledged.
Below are some examples of how to acknowledge different contributions that peo-
ple and organisations have played in your research. Acknowledgements may go in
the front matter of a book (e.g. grammar) or as a footnote on the first or last page of
a journal article or a paper in an edited volume (depending on the publisher’s style):
• The X language material in this study was produced by X in 2017, who lived
in Z; where they are recognised as a cultural leader.
• I thank the community of X for enabling me to conduct research on lan-
guage Y and am grateful to Z who welcomed me into their home and taught
me with patience and encouragement.
• This research has been endorsed/approved by X organisation, which repre-
sents Indigenous interests in the region where X language is spoken.
• I thank my colleagues and collaborators in the field X, X, X etc.
39
[T]he very idea of plural authorship challenges a deep Western identification of any text’s
order with the intention of a single author … Nonetheless, there are signs of movement
in this domain, and we may anticipate a gradual increase in experiments with multiple
authorship. Anthropologists will increasingly have to share their texts, and sometimes
their title pages, with those indigenous collaborators for whom the term ‘informants’ is
no longer adequate, if it ever was.
(Clifford, 1983, p. 140)
Woods (2017. p. 87) also advocates a model of authorship which acknowledges signif-
icant contributions from members of a speech community, noting that “co-authoring
with Indigenous co-researchers is beginning to be considered as a practical way of
managing copyright … [and] is already a practice in the hard sciences but is not yet
seen as an option in the Arts and Humanities”.
If there are speakers who have contributed written definitions to dictionaries this
is a good basis for co-authorship (note that the maker(s) of dictionaries are more
often referred to as ‘compilers’ rather than ‘authors’). Even where speakers contrib-
ute oral definitions which are transcribed by linguists, prominent acknowledge-
ment may be appropriate, for example by listing all of the major players on the front
cover and separating roles into ‘Contributor’, ‘Compiler’ etc. A language speaker
who has played a significant role in translating, transcribing, checking and provid-
ing language material that is crucial to a publication may also be a co-author.
Other rules of thumb apply to different publication types. In the case of vol-
umes of collected texts, one model might be to name speakers as authors. Another
model might be to name narrators of individual stories in the book as authors and
the linguist(s) as the editor(s) of the volume. In the case of grammars, attributing
authorship to speakers makes less sense, as a grammar is an analysis of a language
rather than simply formatted data. In this respect, it ‘belongs’ to the linguist. Even
so proper acknowledgement of speakers’ contributions is crucial. Where a language
only has a small number of speakers remaining or if you have worked with a small
number of recognised language experts, you might consider including the speaker
in the title of the grammar. For example, instead of “Grammar of X, a language
40
The information included in the reference ensures that the example sentence links
back to the corpus and is ‘discoverable’. It also acknowledges the ‘creators’ (speakers,
other linguists). The bracketed information is as follows:
• Bilinarra language
• AN speaker’s initials
• Narrative genre type
• RN recordist’s initials
• 1990 year (or date) of recording
• -002 sequentially numbered recording of that year or date
• 09:45 min time into the recording where the example sentence begins
In a published work, you might also include a footnote after the first example sen-
tence outlining the abbreviations in the sentence reference, for example:
All examples will be referenced with the following information: Language (Bilinarra),
speaker (initials), genre and recording. The Bilinarra speakers were: AN etc. The gen-
res are procedural, description, narrative and conversation. The name of the recording
includes the linguist’s initials. The linguists were: RN etc. The recording label also includes
the year the recording was made and the start time of the utterance in a recording, for
example 09:45 min.
This information might also occur in the introduction of a grammar (Chapter 6). If
you are making a dictionary database this information can be documented in the
source information (Chapter 7). Alternatively, you may include it in the metadata
for each recording and the source field simply refers to this.
41
Start preparing for your field trip well in advance. The more planning you do, the
more likely it is that the fieldwork will run smoothly. Fieldwork may take unex-
pected turns, as seen, for example, in Stephen Levinson’s story about planned
work in Papua New Guinea described in §2.2. Although you cannot decide in
advance exactly how your fieldwork will run you can plan for contingencies. Thus,
greater planning gives you greater flexibility in the field, as you can have a num-
ber of tasks and aims at your fingertips that you can work with if circumstances
suddenly alter.
If you will be driving in the field make sure your licence and vehicle registra-
tion is up to date. Depending on where you are going, you may need to do a 4WD
course. Consider whether you are better off hiring a car or encouraging your
university to buy one. For long-term or repeated field trips buying a vehicle may
be a more economic option than hiring a vehicle. In some countries it is difficult
to hire a 4WD vehicle, or even an ordinary car, if you are under 25 years of age.
If you are using local transport, study the options and consult the latest travel
guides and websites.
You should consider whether you need vaccinations or other medications spe-
cific to the field site. Find out if there are any illnesses endemic to the area where
you are going and where the nearest hospital is (see also §9.3.6). Even if you are not
going overseas it may be a good idea to get a flu shot or rabies shot, depending on
what health services are available at the field site. Before travelling your university
may require you to do a formal risk assessment for your fieldwork. Don’t leave this
until the last minute, as you may need to purchase additional safety equipment or
undertake first aid or other training. Note that first aid kits do not usually include
paracetamol and other medications.
Assemble your research materials, field guides and recording equipment
(which we discuss in the next chapter), and pack and weigh them. If you are
flying, you may need to purchase extra luggage. Pay particular attention to
any customs restrictions, either on your journey to the field or on the way back
home.
what is common practice amongst linguists or other social science colleagues who
work in the region where you plan to go.
If you are planning to compensate speakers, consider the practicalities of this.
If it is with food or other resources, how will you obtain them (see §1.3 for Evans’
story about travelling to the field laden with knives and ladies’ underwear)? Is
money more appropriate and will it take the form of cash, cheques or a bank
transfer? If cash is involved, investigate the logistics of changing money in the
country where you are going. In some areas, people may not have bank accounts
or access to the internet. Find out what is going to be practical in the field site
and discuss it with your institution’s administration section to ensure that the
method you propose is acceptable to the university and that you have all the
necessary paperwork. If you are paying people cash, you will need some way to
document their receipt of payment. Note that in some cultures it is very offensive
to ask for a signature so you may need to have oral documentation of the receipt
of payment or gifts. You will also need to think in advance about how much
cash you will need. Is your project a grammar, working with few people for long
lengths of time; or is your project experimental, working with lots of people for
short periods? These types of considerations will determine how much cash you
might need and what kinds of denominations you need to organise. Your institu-
tion may also require you to fill in specific forms as regulated by your country’s
taxation system (if working in your own country). Also think about whether you
will be paying participants after each session, at the end of the week or at the end
of the field trip.
2.8 SUMMARY
Fieldwork can be a unique, exciting and formative experience, and even though
there may be logistic hurdles where things don’t go entirely to plan, many sea-
soned fieldworkers regard their time in the field as a highlight in their train-
ing and experience as linguists. The personal and academic rewards of
fieldwork –discovering new, exciting things about a language, having intercul-
tural experiences that no travel agent could ever imagine, forming new friend-
ships, learning new skills and discovering talents you thought you did not
have – are many.
Although this book makes many suggestions for methods in the field, it is
important that you maintain a critical and open mind, as some tried and tested
methods that work in one situation will turn out to be a failure in others. It is
OK to abandon, or adapt and innovate. Also keep in mind that, no matter what
academic goals motivate the research, the rights of the host community and the
personal ethics, integrity and sanity of the fieldworker must always take priority.
In the next chapters we turn to techniques for recording, analysis and manage-
ment of data.
45
For discussions about choosing a field site and ethics see Chapters 10 and 11 in
Bowern (2015) and Chapter 1 in Newman & Ratliff (2001). For discussions of ethics
and rights and responsibilities in linguistic fieldwork see Austin (2010), Rice (2006,
2012) and Wilkins (1992). Newman (2007, 2012) gives a detailed overview of copy-
right and legal issues in relation to linguistic fieldwork. For a discussion of ethics
and research practice in sign language research see Crasborn (2010) and Singleton,
Martin, & Morgan (2015). For a good ‘how to’ resource, see the frequently asked
questions section on the Research Network for Linguistic Diversity (RNLD) web-
site.9 Join the mailing list! For a very extensive checklist of things to take to the field,
see James Fox’s list.10
NOTES
REFERENCES
Ahlers, J. C., & Wertheim, S. A. (2009). Introduction: Reflecting on language and culture
fieldwork in the early 21st century. Language and Communication, 29(3), 193–198.
Austin, P. K. (2010). Communities, ethics and rights in language documentation. In P. K.
Austin (Ed.), Language documentation and description (Vol. 7, pp. 34–54). London: SOAS
University of London.
Benedicto, E., Antolín, D., Dolores, M., Feliciano, M. C., Fendly, G., Gómez, T., Baudilio, M.,
& Salomón, E. (2007). A model of participatory action research: The Mayangna linguists
team of Nicaragua. In Proceedings of the XI FEL Conference on ‘Working Together for
Endangered Languages-Research Challenges and Social Impacts’ (pp. 29–35).
46
Van der Veen, L., & Medjo Mvé, P. (2010). Theory and practice of data collection for phonologi-
cal analysis. [Link]-[Link]/fulltext/Van%20Der%20Veen/2_Steps_phonologi-
cal_[Link]
Wasamba, P. (2015). Contemporary oral literature fieldwork: A researcher’s guide. Nairobi:
University of Nairobi Press.
Wilkins, D. (1992). Linguistic research under aboriginal control: A personal account of field-
work in central Australia. Australian Journal of Linguistics, 12(1), 171–200.
Woods, L. (2017). Ethics in linguistic research and working with Indigenous communities:
Redefining collaborative linguistic research, Indigenous and non-Indigenous perspectives
(Master of Applied Linguistics). Monash University, Melbourne.
48
3.1 INTRODUCTION
It goes without saying that technologies are changing all the time. Without a time
machine or a linguists’ version of the Tardis that would enable time-travel to dis-
tant futures, it is not possible to future-proof every aspect of data collected in
DOI: 10.4324/9780203701294-3
This chapter has been made available under a CC-BY-NC-ND license.
49
Figure 3.1
Horace Watson recording Mrs Fanny Cochrane Smith. Sandy Bay, 10 October, 1903. Photographer:
Howard & Rollings, Hobart. Collection: Tasmanian Museum and Art Gallery Q7709
fieldwork. Options for recording many aspects of language have changed dra-
matically since Fanny Cochrane Smith committed her voice to a wax cylinder.
Whereas sound was once recorded on wax cylinders, reel-to-reel tape or analogue
tapes, nowadays most items are born-digital. Even so, it is important to approach
technological innovations with a healthy degree of scepticism. For example,
whereas some clay tablets have survived for many thousands of years, the life
expectancy of an analogue cassette tape is several decades at the very best. Along
with the affordances of technological advances come new practical and technical
issues as well as ethical ones. Because technologies are changing so rapidly, in
the following sections we limit our discussion to general principles, with little
information that recommends particular brands of devices or equipment. It is a
good idea to ask around, see what your colleagues or supervisor/advisors do, and
refer to online discussions or websites where reviews of technical equipment are
posted.
Before making decisions about what kinds of equipment are suitable for your
task, try to find out about the kinds of recording locations you will have access to in
the field. Is it likely that recordings will be made outdoors or are there community
facilities that can be turned into a temporary makeshift recording studio? Will you
have access to electricity, or do you need to take lots of batteries or a portable solar
50
panel? Identify the type of recording that will best suit your research needs, and the
needs of the community. Your modes of transport for the field trip will also deter-
mine what is practical in terms of the size and weight of your equipment collection.
If you have access to a 4WD vehicle that is semi-permanently stationed near the
field site then your choices will be very different from those you will make if you are
travelling, at least part of the way, on bicycle or on foot. Budgets also limit choices.
While in general terms you get what you pay for, careful attention to recording
techniques enables you to make very good recordings with equipment on the less
expensive end of the range.
3.2.2 Microphones
Several microphone types are regularly used in language documentation (for a
detailed discussion of the pros and cons of different types see Boyd & Hardy, 2012;
Margetts & Margetts, 2012, pp. 24–32). No single microphone will be suitable for all
recording situations, so you should have a small range available to suit the variety of
situations you expect to encounter. A stereo vocal microphone is good for recording
singing and oratory; a head-mounted microphone for phonetic recordings, and a
wireless lapel microphone for recording events where people might move around a
lot (see §[Link] on language acquisition studies).
52
accommodates many inputs. This can be very complex to set up, but is good
for recording conversational data.
• Head-mounted microphones work well when there is background noise and
where you are targeting only one speaker. They are particularly good for
phonetic studies as the distance between the speaker and the microphone
remains consistent throughout and comparable across different recordings
(see §5.10).
• Shotgun microphones are mounted on a video camera, and they are ideal
for capturing interactions of large groups of people, as well as soundscapes
e.g. birds, and general outdoor ambience. It is possible to place two shotgun
microphones in a crossed pattern to create audio depth of field.
3.2.3 Windshields
No matter how good your microphone is, it is important to include a wind-
shield in your kit. Shock mounted or suspension windshields are designed to
protect your microphone from wind noise (the sound made by the wind buf-
feting the microphone) and from the shock and vibration caused by handling
the recording device. Other windshields come in many shapes and sizes, and
may be known colloquially as ‘fluffies’, or ‘dead cats’ because some of them are
made from artificial fur. For effective wind protection there should be an inner
foam layer and an exterior layer of fluff, and many windshields integrate these
two layers. It is important to make sure the windshield is designed to fit your
microphone –use a short fitted windshield for a short shotgun (i.e. a ‘stubby’),
and a small ‘minifur’ or ‘dead kitten’ one for a lapel microphone. When filming
with a microphone and a windshield that is mounted on the camera make sure
the furry bits of the windshield don’t intrude into the camera shot. At times it is
hard to see this happening in full sunlight conditions and with a small camera
monitor, but a furry-framed film can be a very disappointing outcome. One way
to combat this is to use hairspray or gel to control the fluff. Sprinkling pepper on
a ‘fluffy’ is a good way to keep dogs from running off with your microphone if it
is positioned on the ground.
54
advantageous. For example in the BSL (British Sign Language) documentation pro-
ject, four cameras were used to film sign (Schembri, 2010, p. 131). Although having
more than one camera may be beyond your budget, and coordinating a multi-shoot
event may seem a daunting prospect, it is worth considering the possibilities. You
can always recruit others to help! If you do use more than one camera, to make edit-
ing simpler make sure that they have similar recording capacities and you use the
same frame rates (see §3.3.3).
All of that said, remember that it is certainly possible to use a high-end expensive
camera badly, and a lower range one well. There are three main criteria for choosing
a video camera:
• There should be an output format of either AVI or AVCHD, which can be
edited using professional software such as Adobe Premiere or Final Cut Pro,
rather than consumer-grade editing software. Note that AVI and AVCHD
are both compressed formats. This is why FCP and Premiere can use them –
otherwise there would be too much data in the video stream for the software
to cope! Some cameras can record both uncompressed and AVCHD, but
they’re not generally in the budget range of a field linguist.
• Your camera must have inputs for an external microphone(s) or radio mic
receiver (see §3.2.2). It should be possible to control the recording level of
the external microphone(s). Linear PCM input is recommended. This is an
uncompressed audio data format and closest to the original analogue signal.
Make sure your microphone outputs and camera inputs are compatible (i.e.
either both mini-jack or XLR) and, if they are not, that you have adaptors.
Some cameras have two or more microphone inputs, which makes it very
easy to record two people on separate tracks, or to record close-up sound
with one and general over-all sound with the other.
• There must be a headphone socket for monitoring sound levels. Headphones
should be ‘monitoring headphones’ which are specifically designed for the
range of the human voice.
Although hand-held film recording has its place, there are good reasons to purchase
a quality tripod for your camera(s). Apart from dramatically improving the quality
of the film by avoiding the jitters, if your camera is fixed to a tripod this frees up your
hands to step away from the event, be less intrusive, and concentrate on other things.
Another factor is that holding a camera steady for long periods of time is very tiring.
57
If you don’t have a tripod, a sandbag or a bucket can be a good substitute, or try sitting
cross-legged with your elbows on your knees as a make-shift tripod. Small desktop or
mini tripods are also very useful for mounting lapel microphones if placing them on a
person’s body is not an option (§3.2.2). If you anticipate that you’ll be panning and tilt-
ing (moving the camera sideways and pointing it up or down) while the camera is on
the tripod, ensure the tripod has a ‘fluid head’. This is a lubricated system that makes for
smooth horizontal and vertical motion of the camera. As a note of caution –unless you
look after your tripods, the fluid heads can quickly get clogged up with sand and grit!
A lens hood to protect the camera lens and to prevent light from hitting the lens
from the sides and creating flare is also a good idea. Pictures taken with a lens hood
tend to have richer colours and deeper saturation. Don’t forget to think about bat-
teries. Many cameras come with a standard size battery that will record for a couple
of hours. Most brands provide another larger battery that will fit in the camera, so
buy two and develop good battery charging habits. Before a recording session the
in-camera battery should be fully charged, and there should be another charged
battery ready in the camera case.
fact that hard drives fail, and this, along with human error, loss, theft and software cor-
ruption are all reasons to back up data regularly and meticulously. If you have multiple
backups you will feel vindicated if someone sends you a message like, “lucky we did
that backup the other day –the other hard drive seems to have crashed!”
When thinking about how much space you might need to back up data, do the
maths. The amount of storage you will require depends on the camera you use
and the recording settings. For example, a video camera that records at a bitrate of
15Mbps (megabit/second) would require under 7GB for one hour of footage. Bear
in mind that the storage space required to edit the footage will be greater –an hour
of ProRes 422 25fps material edited in Final Cut Pro will require up to 50GB just for
the media, with additional space needed for previews and edit rendering.
Some memory cards are not suitable for the high data rates required for high
quality video recording. Check the speed rating of your cards, and their com-
patibility with your camera. For an SD-type camera, use SDHC or SDXC cards.
Consumer-quality video cameras may not be compatible with high-speed cards.
Be sure to take spare memory cards in case your camera or audio recorder fills up
mid-shoot. Some cameras have two slots that allow for a smooth transition between
cards when one card fills up. Check the recording time of your camera batteries and
make sure you have a spare, and the battery charger.
Check that you have power adaptors and chargers for all your electronic equip-
ment, and the appropriate country-specific power plugs. Charge everything before
you head to the field. You may want to start recording as soon as you arrive! Take
extra audio plugs/adaptors and male/female joiners –XLR to audio-jack, RCA to
audio-jack, audio extension cables, headphone jack adaptors, microphone and head-
phone splitters (to allow the use of multiple mics/headphones on a single channel).
Always turn equipment off before changing batteries. The electronic circuits in cam-
eras and recorders can be damaged by the sudden power surges of switching batteries
while turned on. Repairs can be very expensive, not to mention the inconvenience of
doing fieldwork without a recording device! Consider bringing spare drives or USB
sticks to leave copies of the recordings behind for the community to use.
room for two video cameras, an extra microphone and other bits and pieces weighs
around six kilograms). Inside these cases dedicated foam nests can be fashioned to
protect the equipment. Other strategies to protect equipment include waterproof
bags for rain, river crossings or boat trips. It is very important to keep gear clean,
and in some field sites, dust can be a hazard to cameras as well as to other equip-
ment. Buy a puffer brush and a can of compressed air to clean your camera lens.
Extreme high or low temperatures can affect equipment adversely, and working in a
humid environment requires extra care. Moving from cool, air-conditioned environ-
ments such as a vehicle, to a warm, moist outdoor environment can cause condensa-
tion to form inside the camera or on the lenses and this may affect the clarity of the
image, or even prevent the camera from operating. If this happens repeatedly, it can
lead to mould growing in the camera. To prevent condensation from forming, wrap
the camera in something like a dry towel or keep it in a cloth camera bag before mov-
ing from a cool environment to warm. This prevents the warm, moist air from coming
into contact with the camera. Keep the camera covered until it has had a chance to
warm to a temperature above the ‘Dew Point’, the temperature below which conden-
sation will form. Silica gel crystals, or even roasted uncooked rice, can be added to
airtight containers to keep equipment dry and protect it from humidity. Plugs can also
be purchased to protect your computer ports from ants and other unwanted intruders.
The weakest part of our planning turned out to be the most critical of all.
The flexible solar panel we had purchased, which we had assumed to be the
most robust of all equipment, turned out to simply melt in the Papua New
Guinean midday sun. It failed in the first week of use, leaving us with no
power for the first month. With an almost entirely digital set up, this was just
about the worst thing that could have happened … with a multimeter, tin foil
from a chocolate wrapper and some gaffer-tape, I was able to chain enough
of these batteries together to produce enough voltage to run or charge other
equipment.
(Honeyman, 2006, p. 20)
batteries in checked-in luggage, so your journey will be less stressful if you think of
this ahead rather than having to deconstruct all of your carefully packed luggage and
locate the batteries at the airport check-in counter.
to these sounds that you hardly notice them (until you listen to your recordings
later). It is often harder than you think to find a quiet place where human voices can
be recorded with clarity. It is also a fact that the more naturalistic the field session,
the less likely it is that your recordings will be ‘studio quality’. For much linguistic
work this does not really matter. That said, there are certain steps that can be taken
to ensure the best quality audio in the circumstances, such as thoughtful selection
and placement of microphones (§3.2.2).
When recording sound, the aim is to maximise the amplitude of the speech signal,
while at the same time avoiding clipping or signal distortion. As shown in Figure 3.2,
clipping occurs when the amplitude of the signal exceeds the maximum range, and
it introduces high frequency components. This is particularly problematic if you’re
63
Figure 3.2
View of an audio waveform in Audacity that shows a well-recorded segment of speech contrasted with
a segment where the levels were set way too high and there is extreme clipping (Illustration: Maxine
Addinsall)
settings 1.0
Mono, 48000Hz Extreme clipping
32-bit float 0.5
Mute Solo
+ 0.0
L R -0.5
-1.0 Good sound levels
Don’t forget to record important aspects of the context of the recording session.
If the session involves multiple conversations, or even if the session is a more tradi-
tional question and answer routine, make sure the audio signal from all participants
is recorded with clarity, and, if necessary, adjust the placement of the microphone(s)
accordingly. For instance, if somebody asks questions and the recording does not
pick these up, recordings of answers such as “Yes. No. Yes” without further context
may be next to useless. It is a good idea to record a few minutes of ordinary conver-
sation before you start the session proper, as this will enable you to adjust the sound
input to appropriate levels. Some people speak louder than others, and be aware that
their voice may get even louder as the session continues and they relax! Record the
linguist/elicitor at the beginning of the recording, giving some general facts about
the session, the date and time of day, the participants, and the location of the record-
ing. Also record a few minutes of the ambient environment with the same micro-
phones that you’ll be using for the main session. If there is general background hum
or noise, this ‘noise-bed’ can make it a lot easier to clean up the audio later.
At that time, 1973, I bought one of the first stereo Nagra recorders imported
to the United States … In order to demystify the whole thing I ran a lead out
of the Nagra into a small cassette recorder so that everything I was record-
ing on reel-to-reel generated a simultaneous cassette copy. I would later sit
down with the reference cassette and a bunch of headphones and play the
recordings back to people. It was important to play back over headphones
rather than over loud-speakers because I didn’t want the sound to dissipate
into the world. I wanted people to listen to the recording as I had been lis-
tening when I originally recorded it. So we would sit down around the cas-
sette recorder with one, two, three, four, five people wearing headphones
and they would spontaneously start talking about the recordings as they
listened. I soon realised this was the beginning of a whole methodological
programme for recording and feedback and more recording. I came to call
this method ‘dialogic editing’ and was inspired by Rouch’s similar experi-
mental methods in film feedback. Recording wasn’t just about gathering
things but it was the invitation to a conversation about what was going on
in the world as recorded, about what we were listening to, how we knew
and questioned the world by listening to it, how we edited and arranged
its meanings like a composition. This became a method through which
I could really engage people, so that’s how it became the foundation of a
dialogic recording process and the basis of all the research I did in New
Guinea across the lines of language, music, song, and rainforest ecology.
(Feld, 2013)
65
Check the horizon line and make the camera level –some tripods have a spirit
level which helps with this, but note that a tripod resting on sand or soft ground
may get out of kilter very easily. Take time to frame your camera shot and make
sure it accomplishes the aims of your research project. The importance of ensur-
ing that the camera angle established is wide enough to capture all of the action
is akin to the need to get the settings right when making audio recordings. If the
camera is too close to the ‘subject’ this can lead to the loss of valuable data. If
you are studying signs or gestures that employ a large signing or ‘gesture space’,
it is frustrating if some of the details of how the signs or gestures formed are off-
camera. Keep in mind that dynamic human interaction is by its very nature unpre-
dictable. It is better to take a wide view that will encompass all of the ‘action’ than
to zoom in and out, focusing on details but then probably missing subsequent
relevant ones. Zooming and panning is an art that is usually poorly executed by
amateurs (most field linguists!) and should be avoided. The use of film in language
documentation is very different from making a movie, and it is best not to imag-
ine that your framing problems will be solved by some magical process in post-
production. Using several cameras that take different views is one way to achieve
both a close-up and a distant perspective. This may be of particular importance
if you are recording songs or performances with lots of people involved. When in
doubt take a wide view!
Consider how the people you are filming look, and, if possible, seek local advice
about any community-specific considerations. For example, are there ways of sitting
that would be seen to be culturally inappropriate? In Arabic cultures, revealing the
soles of your feet, or the bottom of your shoes, is regarded as being disrespectful. In
other communities women will only sit with their legs to the side, or only sit cross-
legged if their groin is well covered by billowing skirts. Is the clothing worn by the
consultants ‘presentable’, or more to the point, will people be happy about how they
look when they review the films later? You don’t want to spend many hours filming
only for the participants to decide in retrospect that they wished they had different
T-shirts on or didn’t have their petticoat showing! Particular patterns in clothing,
for example fine stripes, are to be avoided (if possible), as they can lead to unwanted
film effects. In some filming situations, the dazzling effect called the ‘moiré effect’
stems from interference between fine detail in the scene, and the sensors in the
camera. When filming sign language, the colour and pattern of the clothing can also
make a difference as to how easy it is to see the fine details of the articulation of the
hands (see Chapter 8).
around the narrator, as well as the use of the ground in front of them.
To achieve this sand stories were filmed with two cameras, thus capturing
two viewpoints simultaneously. One camera, attached to a lighting
stand, pointed vertically down and filmed the sand story space on the
ground from above. The height of the boom was sufficient for the camera
to capture the breadth of this space. This simple device is stable, yet it can
be moved easily to adjust for movements of shadows and minor relocations
of the storytelling space. The other camera, attached to a tripod and
placed at a distance, captured a front view of the narrator, including the
gestural extent of their arms (plus any augmentation provided by story-
wires or sticks). Audio is recorded using a separate audio recorder (see
Figure 3.3).
Figure 3.3
Recording Kaytetye sand stories. Carol Thompson, Myfany Turpin and Tommy Thompson
(Photo: Jennifer Green). Gear (i) 2 HD digital Panasonic video recorders, (ii) Fostex
FR2 digital audio recorder, (iii) Rode NT4 microphone in a Rode Blimp windshield
on the microphone stand, (iv) Sennheiser wireless microphone pack, (v) two tripods,
(vi) lighting stand, (vii) reflector, (viii) two sets of headphones, (ix) pelican case for the
video gear
Source: Green & Turpin, 2013
68
become a substitute for more traditional methods (Duranti, 1997, pp. 115–116).
Keeping daily notebooks helps you to keep track of people, places and dates, field
recordings, the tasks you have completed and what is still left to do, and all man-
ner of other information that will eventually become part of your metadata (§4.2).
Notebooks are also a good place for sketch maps and diagrams, either made by
yourself or others in the research team.
Hardcover stitch-bound notebooks made of acid-free archival quality paper are
best. Some prefer to use smaller format A5 size notebooks that are easier to carry
in backpacks or handbags than are some larger A4 size ones. It is a good idea to
use waterproof pens, and to write on one side of the page to avoid ink bleeding
through and making the reverse page illegible. Some linguists only use the right side
of the page for fieldwork sessions and keep the left page for later comments. Cross-
references should be made in the notebooks to all field recordings. Notebooks can
be sorted by date or by topic, depending on the diversity of your fieldwork locations
and types of recordings. Make a note of all abbreviations and symbols that you use
(Bowern, 2015, p. 52). Using a diary to keep track of the different tasks in each field-
work session, and to note emerging hypotheses and to plan future work may be a
good idea. Other linguists use the pages at the back of their field notebooks for this
sort of information.
If you are using notebooks extensively or checking printed drafts of dictionaries
or other resources, it is not a bad idea to use different coloured pens for subsequent
checking sessions. The different colours can help you sort out when you made a cor-
rection to a previous annotation. For example, if you check something three times,
it might be unclear which annotation was the final correction. But if you used red
for corrections in the second session and green in the third session, you know to
take notice of the green corrections. Coloured pens can also be useful for coding
different kinds of information, for example which speaker told you what in a single
recording session, which pronunciation of a word came from whom and so on.
Of course you need to archive the code for your colour scheme, and hope that the
colours don’t fade over time!
If you are new to the language and the speech community, your first fieldwork ses-
sion should involve eliciting words within a simple semantic domain, as we outline
in Chapter 7. In this section we discuss the process of recording and conducting the
first fieldwork session. It is a good idea to try to record one speaker in a quiet loca-
tion. Other people may want to be present, but multi-party conversational interac-
tions are too complicated to manage and transcribe, at least as a first step. Keep this
first session very simple and well-planned. For this first session it is a good idea
not to demand too much concentration from the speaker (and yourself) and at the
same time reinforce the fact that they are the experts in their own language. Use
this first session to establish a good rapport with the speaker and put a meticulous
recording and management routine in place.
70
Before you begin have a brief conversation with the speaker to help you both
relax and feel comfortable together. The sorts of things that constitute such ‘small
talk’ in cultures differ. If you are not sure what this is, you can always begin by say-
ing how happy you are to be starting this work and how you are looking forward
to working on the language. Following this you will need to discuss the project in
general terms. Official documentation of consent might be done later after you have
recorded the session and participants have a clearer idea of what they are consent-
ing to (§2.5).
You will need to set up and test your equipment. Explain to the speaker what you
would like to do in this session and run through one or two questions so that you
both know what to expect. Write down basic metadata (date, time, place, speaker,
topic) (§4.2.4). Do a dry run and check your equipment again. Make sure your
recording device is in view so that you can see if the audio levels are right, if the
batteries are getting low, or if the card is nearly full. Being able to focus on your
elicitation while monitoring your equipment is a necessary skill that will develop
the more you do it. When you are both ready, turn on the recording device and
record some simple metadata, including the name of your field notebook and page
number. Begin the elicitation. At a suitable point, after a few minutes of work, stop
the session and play it back for the speaker to hear (you won’t need to do this in all
subsequent sessions). Continue the session.
It is important to avoid being proscriptive about what the consultant should and
shouldn’t say. Crowley (2007, p. 96) suggests that one of the golden rules of elicita-
tion is to never tell the speaker that they have misunderstood. Even though you may
want to hear only the target word with two repetitions; such requests may make
people feel bored and unwilling to work with you in the future. If the speaker offers
the word in an example sentence, even better. Don’t worry if you can’t understand
what people are saying straight away. Rather focus your attention on writing down
what you think the target word is as accurately as you can. In the initial stages of
fieldwork it is good to learn a handful of useful expressions that will help you to
work with people in their language, such as “Can you say that again please?” or
“Could you talk a bit slower”.
Once you think you have heard a target word accurately and written it down,
it is time to check your pronunciation with the speaker. Take careful note of their
response. You want to avoid a situation where the speaker is giving you positive
feedback simply because they perceive that you are trying hard and they don’t want
to hurt your feelings, or because your pronunciation is close (§6.6.1). Hesitation,
facial expressions or tone of voice will often tell you more than their words about
how well you have pronounced the word. Always write down what they say, even
if you know it’s not the answer to the question you asked. Try asking your question
a different way and if that doesn’t work move on and come back to it another time.
At the end of the session thank the speaker and everybody who has participated.
Play the whole session back to the speaker. The speaker may make some corrections
or additional comments and will now be in a good position to discuss archive and
access conditions; and you should now document consent (§2.5.2) and complete
any other speaker-related metadata. If there is some local way of leave-taking or
71
3.6 SUMMARY
This chapter has outlined procedures for effectively using audio, video, photogra-
phy and notebooks in the field. We discussed requirements for selecting recording
equipment, including the ideal specifications for microphones, cameras and audio
recorders for use in particular field conditions. The chapter emphasised the impor-
tance of practising how to use your equipment before setting out. We ran through
ways to conduct your first recording session in the field, and methods to make these
first sessions as enjoyable and successful as possible. We concluded with the end
of the recording session when the day’s recordings and field notes are transferred,
checked and backed up.
NOTE
1 [Link]/radionational/programs/drive/fanny-cochrane-smith-and-horace-
watson/8256806 Accessed 1 August 2017.
REFERENCES
4.1 INTRODUCTION
Now that you have made some recordings and the data has been transferred from
the recording devices and backed up, the next step is to learn how to manage all
of this material. This chapter leads you through aspects of data management,
and ways to transcribe and annotate the recordings. It is a good idea to begin to
deal with the important aspects of data management as soon as is practical, or
else matters can quickly get out of hand. The first aspect of this is deciding on
systematic ways to name the files you create (§4.2.1). Associated with this is the
process of recording and storing metadata relating to the recordings (§4.2.3). We
then move on to discuss the different ways that linguistic data can be transcribed
and annotated (§4.3), thus setting the foundation for various types of analysis.
Finally, we discuss some issues involved in archiving linguistic data, maximis-
ing the chances that your data will be useful in both the immediate and the far-
distant futures (§4.4). Whether or not some of these tasks can be undertaken in
the field or not will depend on many factors –the length of time you are able
to spend there, internet access and so on, but establishing good practices early
on saves a lot of time in the long run. Also keep in mind that fieldwork can be
viewed as an iterative process, with refinements of tasks building up over time.
DOI: 10.4324/9780203701294-4
This chapter has been made available under a CC-BY-NC-ND license.
74
Table 4.1
Examples of file naming
JD2008_MUD052_[Link]
Table 4.2
Some metadata types and categories
Recording as Language; Session number; Content summary; Recorded by; Speaker(s); Others
a whole present; Date; Time; Place; Location, direction; Prompts, tasks and context
Participant Name(s), gender, date and place of birth, ethnic affiliation or clan; languages
metadata spoken or understood; languages spoken by family; residential history; profession,
education or social position; skills: storyteller, interpreter, recognised cultural elder
Figure 4.1
Metadata attached to a recording made on a cassette tape
Figure 4.2
A simplified view of the structure of a relational database for keeping track of metadata
See also Thieberger & Berez, 2012, p. 107 (Illustration: Jennifer Green)
Transcript Archive
Transcript ID Archived?
Transcript by Where archived
Date created Date archived
Date updated Archived by
Access conditions
79
Transcripts are a tool of the linguistic trade, and the term usually refers to some
type of graphic representation of an utterance. To make one you may employ a
dedicated notation system such as the International Phonetic Alphabet (IPA) (for
phonetic or phonological transcription), or a script such as the Latin, Chinese or
Arabic one. Although sign languages are not generally written down, some tran-
scription systems that have been developed for them are discussed briefly in §8.7
(see also Johnston, 2010, p. 110).
The choices you make about what you transcribe depends on your research ques-
tions, and what the transcript is to be used for. Ochs (1979, p. 44) writes that “a
more useful transcript is a more selective one” reflecting the fact that only certain
dimensions of language can be represented in any particular transcript at one time
without it becoming overloaded with detail. The extent to which a transcript can
and should accommodate multiple research interests and goals varies. What every-
one agrees about, however, is that transcription is very time consuming. A very gen-
eral rule is that one hour of audio recording can take up to ten hours to transcribe.
Of course this varies depending on how well you know the language, how com-
plex the language on the recording is (e.g. multiple speaker conversation vs single
speaker word elicitation) and what level of transcription you require. Transcribing
at the word level will be less time consuming than making a phonetic transcription.
Transcriptions using Conversation Analysis methodologies, where, for example,
the length of pauses are indicated, will also take longer.
Broadly speaking, annotation refers to the addition of other layers of interpre-
tation to primary data. This may include a standard written form, but in addition
many other aspects of language including phonological structure, morphological
glosses, parts-of-speech, and information structure categories. The list goes on and,
again, depends on your research interests (see Schultze-Berndt, 2006).
Text-Speaker 1
> Nr Annotation
1 Artwarle atye awerleweye! [Link].470 [Link].880 [Link].410
2 Artwarle atye arerlewe [Link].700 [Link].760 [Link].060
3 Ampelarte re apenhe [Link].840 [Link].207 [Link].367
4 Artwarlatyarerlewe reye [Link].560 [Link].530 [Link].970
5 Ampelarte re apenhengerne [Link].680 [Link].820 [Link].140
6 Ampelarte re apenhengerne [Link].230 [Link].270 [Link].040
7 Twepe-twepe akenhe [Link].490 [Link].500 [Link].010
Video viewer
[Link].400 Selection: [Link].000 - [Link].000 0
Time codes
S S → ← → ↓ ↑ Selection: Mode Loop Mode
Annotations
82
CLAN6 is an annotation and statistical analysis tool that has a large c ommunity
of users in language acquisition (§9.4.1) and Conversation Analysis. The coding
and search capabilities of CLAN also make it a powerful tool for quantitative stud-
ies of variation and language contact (§10.4.2). It was developed as a part of the
CHILDES (Child Language Data Exchange System)7 project whose aim was to
build a large database of homogenously formatted, accessible and analysable child
language corpora from a variety of languages (MacWhinney, 2000). Currently there
are around 130 corpora in the CHILDES database, covering a range of languages
including English, German, Afrikaans, Catalan, French, Japanese and Cantonese. It
consists of two main components: the annotation software, CHAT, and the associ-
ated statistical package, CLAN. CHAT can be used without CLAN; however, CLAN
is dependent on well-formatted CHAT transcripts. An example of a CLAN file can
be seen in §1.4.
CLAN has sophisticated search capabilities. The %mor tier allows simultaneous
coding of part-of-speech, language (if it is a multilingual setting), morphological
gloss and other features such as animacy, information structure category, gram-
matical relation etc. This means that large-scale quantitative searches can be done
instantly using either CLAN functions or Python scripts. Because CLAN is format-
ted as UTF-08, it is possible to edit multiple files in a text-editing program such as
TextWrangler.
Despite these advantages over ELAN, the CHAT file itself allows less complex-
ity in relating different types of annotation in comparison with ELAN files. Unlike
ELAN, CLAN tiers have a direct one-to-one correspondence with the speaker
tier, which allows less flexibility in relating the tiers to the media file. A difference
between the two is that whereas ELAN can deal with multiple video files, CLAN
only has the capacity to import one.8
Other less complicated options for transcription include Simple-ELAN, which
allows for transcription of audio or video on a single layer (i.e. on a single tier), and
F4transkript. The latter has the advantage of having an optional plug-in foot pedal
for fast typing (see review by Jones & German, 2016).9 Transcriber is another free
and easy to use tool.10 All of these are good for doing a first parse of orthographic
transcription.
4.3.3 Segmentation
In the ELAN example shown in Figure 4.3 you can see three distinct sets of annota-
tions that have been added to the time line. You can also see that there are seven
items visible in the pane on the right-hand side of the screen. The process of decid-
ing how long to make each annotation is called segmentation. Decisions about the
segmentation of data, where it is sound (speech and song) or forms of action (ges-
ture and sign), are fundamental to the annotation workflow. Segmentation of data in
each modality raises its own specific issues and the choices you make, and the level
of granularity you require will depend on your research questions. It is generally
accepted that some sort of utterance level unit is fundamental. In speech, intonation
is recognised as performing a “basic delimiting function across languages”; even as
83
the reliable coding of speech into intonation units (IUs) is a complex exercise in
itself (Himmelmann, 2006, pp. 258–270). Although it is generally agreed that there
is no single criterion sufficient for isolating IUs, significant pauses, pitch movement
and vowel lengthening have all been suggested as criterion for the delineation of IU
boundaries.
In the example, segments have been broadly selected on the basis of extended
pauses between instances of speech. This is one good reason to include the wave
form in your ELAN file, even though you can hear any sound recorded by the video
camera – the inclusion of the wave form means that stretches of sound and silence
are easy to see. For phonetic analysis of speech the units of segmentation are going to
be much shorter, and the program Praat is designed specifically for this purpose.11
4.4 ARCHIVING
record in the field anyway? It is also helpful to make a clear distinction between
the short-term care of data, which begins as soon as the data is generated, and the
notion of long-term archiving where research materials are kept for posterity.
Why archive?
For myself and many colleagues, a major incentive to engage with digital
archiving was to seek a strategy for coping with an ever-increasing private
collection of audio and videotapes, originating from various research projects
over the years, materials for which it became ever more difficult to find a
machine that would allow the data to be used in the future. Increasingly, there
are also recordings, usually audiotapes, produced and kept by members of the
speech community, but they frequently get recorded over after a while or are
lost in closed collections.
(Widlok, 2013, p. 4)
Before the end of the twentieth century, physical objects such as notebooks,
photographs and analogue recordings on wax cylinders or magnetic tapes were
generally donated or bequeathed to museums, libraries, universities and to dedi-
cated archival institutions. Gaining access to these records now often involves a
lengthy processes of negotiation, and frequently entails journeys to view material
on site. The fieldworker often mediates such journeys, and these links between
communities and archives are upheld by real or imagined expectations of on-
going personal connections. The inevitability of the fieldworker’s passing, a lack
of resources (or desire) to return to field sites, and many changes in personal
and political circumstances all add up to a perception that some archives are
not really for the people whose cultural materials they house (Ingram, in press;
Seeger, 2004).
The question of who uses an archive and for what purposes is a very impor-
tant one, and the answers will vary from community to community, and over
time. Modern archives may engage diverse audiences, including community mem-
bers, scientists of language and other related disciplines, and the general public
(Conathan, 2011; Woodbury, 2014, p. 21). Whereas linguists may access archives
of primary data in order to test out particular theoretical questions, speakers of
endangered languages may look to an archive for very different reasons. Of most
value may be the sound of a long-lost relative’s voice, a fragment of a song that
almost everybody has forgotten, an account of an historical event or a photograph
(see Holton, 2012; Wasson, Holton, & Roth, 2016, p. 649).
Since about 2010, developments towards participatory models for linguistic
archiving have attempted to break down traditional boundaries between depositors,
users and archivists and to expand the audiences and uses for archives. The impor-
tance of the involvement of speaker communities in archival processes has gained
increasing recognition (Henke & Berez-Kroeker, 2016, p. 428). This may entail
85
devolution away from the large archival institutions and the formation of smaller,
locally based derivative archives. Such models may enable more direct interac-
tions between the archive and community of origin (Ingram, in press; Seeger, 2004;
Treloyn & Emberly, 2013). This reflects an increasing desire and expectation that
collecting institutions will strive to empower speech communities and allow them
a greater role in the direction and management of cultural collections. Nowadays
there are various options to access materials online, and these bring great opportu-
nities as well as new challenges.
Figure 4.4
Bush potatoes as an illustration of a “hub and spokes” model of archives (Drawing: Jennifer
Green, 1978)
8
this as a requirement of your research proposal! All of this contextual information can
help others interpret your data in the future, and provide a user-friendly way into the
heart of the collections. An example of a comprehensive description of a documen-
tation project is one featuring the Ikaan language, spoken in two villages in south-
western Nigeria (see Salffner, 2015).19 This research was designed to look at phonetic
and phonological variation among speakers, but was based on a documentary corpus
about knowledge of food, food production and farming amongst the Ikaan. A land-
ing page for the Daly languages of northern Australia includes an interactive map,
lists of language resources and links to archival deposits.20
he had made some 40 years earlier to find out what the descendants of those
featuring in the film wanted to do with it. The film was about traditional meth-
ods of preparing gambarr, a tar-like substance used for manufacturing spears
and spear throwers. The descendants of those who made the film decided not to
include the film in their local archive.
Some of these concerns can be addressed by discussing access to the fieldwork
data very thoroughly during the consent process (§2.5) and by fostering ongoing
relationships between the archive and the community (Ingram, in press; Seeger,
2004). Reviewing your data by playing it back to people after they have had time
to consider the implications of making their language materials available to others
is also important. It is also a very good idea to seize any opportunity to explore
archives with the people you are working with. Many people are genuinely delighted
and proud to see that their language can be viewed and appreciated the world over,
and most see archives as a safeguard against the loss of valuable cultural knowledge.
Others may, however, decide that they want to place some restrictions on access,
and these requests must be honoured.
4.5 SUMMARY
Travelling to a new field site, meeting people to work with, and making the first
recordings is a very exciting experience. What may seem to be less glamorous are
the careful processes of data management that will in the long run ensure that your
work practices are efficient, that you don’t lose or misplace recordings and, finally,
that your recordings are archived in appropriate ways so that future generations
will benefit from your research. This chapter has dealt with the nuts and bolts of
some of these processes –file naming, metadata and techniques for transcription
and annotation. We also discussed some of the issues involved in archiving, and
accessing archives.
NOTES
REFERENCES
Barwick, L. (2012). Including music and the temporal arts in language documentation. In
N. Thieberger (Ed.), The Oxford handbook of linguistic fieldwork (pp. 166–179). Oxford:
Oxford University Press.
Berez, A., Finnesand, T., & Linnell, K. (2012). C’ek’aedi Hwnax, the Ahtna Regional
Linguistic and Ethnographic Archive. Language Documentation & Conservation, 6,
237–252.
Berez-Kroeker, A. L., Gawne, L., Kung, S. S., Kelly, B. F., Heston, T., Holton, G., Pulsifer,
P., Beaver, D.I., Chelliah, S., Dubinsky, S., Meier, R. P., Thieberger, N., Rice, K. &
Woodbury, A. C. (2018). Reproducible research in linguistics: A position statement
on data citation and attribution in our field. Linguistics, 56(1), 1–18. [Link]
org/10.1515/ling-2017-0032
94
Bird, S., & Simons, G. (2003). Seven dimensions of portability for language documentation
and description. Language, 79(3), 557–582.
Bowden, J., & Hajek, J. (2006). When best practice isn’t necessarily the best thing to do:
Dealing with capacity limits in a developing country. In L. Barwick & N. Thieberger
(Eds.), Sustainable data from digital fieldwork (pp. 45–55). Sydney: Sydney University
Press.
Bowern, C. (2015). Linguistic fieldwork: A practical guide. Basingstoke (England): Palgrave
Macmillan.
Conathan, L. (2011). Archiving and language documentation. In P. Austin & J. Sallabank (Eds.),
The Cambridge handbook of endangered languages (pp. 235–254). Cambridge: Cambridge
University Press.
Crasborn, O. A. (2015). Transcription and Notation Methods. In E. Orfanidou, B. Woll, &
G. Morgan (Eds.), Research methods in sign language studies: A practical guide (pp. 74–88).
London: Wiley Blackwell.
de Souza, P., Edmonds, F., McQuire, S., Evans, M., & Chenall, R. (2016). Aboriginal knowl-
edge, digital technologies and cultural collections: Policy, protocols, practice. Melbourne
Networked Society Institute Research Paper 4.
Gippert, J., Himmelmann, N., & Mosel, U. (Eds.), (2006). Essentials of language documenta-
tion. Berlin & New York, NY: Walter de Gruyter.
Haviland, J. B. (2016). Maing gambarr: It belongs to me, I belong to it. In J.-C. Verstraete &
D. Hafner (Eds.), Land and language in Cape York Peninsula and the Gulf Country
(pp. 455–579). Amsterdam & Philadelphia, PA: John Benjamins Publishing.
Henke, R., & Berez-Kroeker, A. L. (2016). A brief history of archiving in language documen-
tation, with an annotated bibliography. Language Documentation & Conservation, 10,
411–457.
Himmelmann, N. P. (2006). The challenges of segmenting spoken language. In U. Mosel,
N. P. Himmelmann, & J. Gippert (Eds.), Essentials of language documentation
(pp. 253–274). Berlin & New York, NY: Mouton De Gruyter.
Holton, G. (2012). Language archives: They’re not just for linguists any more. In F. Seifart,
G. Haig, N. P. Himmelmann, D. Jung, A. Margetts, & P. Trilsbeek (Eds.), Potentials of
language documentation: Methods, analyses, and utilization (pp. 105–110). Hawai‘i: LDC
Special Publication.
Ingram, C. (in press). “Each in our own village”: Insights from a fieldworker/archivist on sus-
tainable interactions with communities. In B. Woods, F. Gunderson, R. Lancefield (Eds.),
The Oxford Handbook of Musical Repatriation: Oxford University Press.
IRCA. (2015). Audio and metadata usage guidelines: Physical and digitised media (pp. 1–14).
IRCA. Indigenous Remote Communications Association. [Link]/projects/
archiving/reference-group
Johnson, H. (2004). Language documentation and archiving, or how to build a better corpus.
Language Documentation and Description, 2, 140–153.
Johnston, T. (2010). From archive to corpus: Transcription and annotation in the crea-
tion of signed language corpora. International Journal of Corpus Linguistics, 15(1),
104–129.
Jones, C., & German, A. (2016). Review of F4transkript, a simple interface for efficient anno-
tation. Language Conservation and Documentation, 10, 347–355.
MacWhinney, B. (2000). The CHILDES project: Tools for analyzing talk. Mahwah, NJ:
Lawrence Erlbaum Associates.
95
Margetts, A., & Margetts, A. (2012). Audio and video recording techniques for linguistic
research. In N. Thieberger (Ed.), The Oxford handbook of linguistic fieldwork (pp. 13–53).
Oxford: Oxford University Press.
Meakins, F. (2007). Computerized Language Analysis (CLAN) from the CHILDES project.
Language Documentation & Conservation, 1(1), 107–112.
Ochs, E. (1979). Transcription as theory. Developmental Pragmatics, 10(1), 43–72.
Perniss, P. (2015). Collecting and analyzing sign language data: Video requirements and use
of annotation software. In E. Orfanidou, B. Woll, & G. Morgan (Eds.), Research methods
in sign language studies: A practical guide (pp. 55–73). London: Wiley Blackwell.
Salffner, S. (2015). Guide to the Ikaan language and culture documentation. Language
Documentation & Conservation, 9, 237–267.
Schultze-Berndt, E. (2006). Linguistic annotation. In J. Gippert, N. Himmelmann, & U. Mosel
(Eds.), Essentials of language documentation (pp. 213–251). Berlin: Mouton de Gruyter.
Seeger, A. (2004). New technology requires new collaborations: Changing ourselves to better
shape the future. Musicology Australia, 27(1), 94–110.
Singleton, J. L., Martin, A. J., & Morgan, G. (2015). Ethics, Deaf-friendly research, and good
practice when studying sign languages. In E. Orfanidou, B. Woll, & G. Morgan (Eds.),
Research methods in sign language studies: A practical guide (pp. 7–20). London: Wiley
Blackwell.
Thieberger, N., & Berez, A. L. (2012). Linguistic data management. In N. Thieberger (Ed.), The
Oxford handbook of linguistic fieldwork (pp. 90–118). Oxford: Oxford University Press.
Thieberger, N., Margetts, A., Morey, S., & Musgrave, S. (2016). Assessing annotated corpora
as research output. Australian Journal of Linguistics, 36(1), 1–21.
Treloyn, S., & Emberly, A. (2013). Sustaining traditions: Ethnomusicological collections,
access and sustainability in Australia. Musicology Australia, 35(2), 159–177.
Wasson, C., Holton, G., & Roth, H. S. (2016). Bringing user-centered design to the field of
language archives. In M. A. Shepard, G. Holton, & R. Henke (Eds.), Emergent use and
conceptualization of language archives (pp. 641–681). Hawai‘i: LDC Special Publication.
Widlok, T. (2013). 1. The archive strikes back: Effects of online digital language archiving on
research relations and property rights. In M. Turin, C. Wheeler, & E. Wilkinson (Eds.),
Oral literature in the digital age: Archiving orality and connecting with communities (Vol. 2,
pp. 3–19). Cambridge: Open Book Publishers.
Wittenburg, P., Brugman, H., Russel, A., Klassmann, A., & Sloetjes, H. (2006). ELAN:
A professional framework for multimodality research. In Proceedings of LREC 2006, Fifth
International Conference on Language Resources and Evaluation. Genoa, Italy.
Woodbury, A. (2014). Toward making endangered language documentations people can
read, use, understand, and admire. Language Documentation and Description, 12, 19–36.
96
5.1 INTRODUCTION
Phonetics is the representation of human speech sounds, which are defined inde-
pendently of any language. Each phonetic sound or phone is represented by a
unique symbol defined by the International Phonetic Association (IPA). This means
that you can transcribe sounds in any language unambiguously. The IPA has a web-
site explaining all these symbols with corresponding sound files.1 Phonology, on
the other hand, is the interpretation of speech sounds in a particular language: the
structure of sounds that produce meaningful contrasts. The segments of phonol-
ogy are called phonemes. As field linguists, we want to record and analyse as much
speech as possible in order to understand how speech sounds pattern in a given
language.
A good phonological analysis rests on quality phonetic transcriptions of a siz-
able corpus of recordings from a range of language speakers. As Hayes (2011, p. 55)
notes, “a really solid phonemicisation is often the result of years of hard work, car-
ried out by linguists with good ears and extensive experience with the target lan-
guage”. In this chapter we discuss the issues, methods and preparation needed for
working out the phonology of a language to ensure you are getting the most out
of your fieldwork and not wasting the time (and patience!) of language consult-
ants. For methodology in phonological analysis, we refer to Gussenhoven & Jacobs
(2011) and Hayes (2011). These include exercises to improve your skills in pho-
nological analyses, which is essential preparation for fieldwork on the sounds of a
language.
Before you begin fieldwork, you should have clear objectives about what you
wish to achieve in your phonological analysis. For example, do you want to identify
all the contrastive sounds of a language (the phonological segmental inventory),
determine the syllable structure, or describe the degree of variation associated with
each contrastive sound? You should also be explicit about how much detail you are
representing in transcription of sounds. In your early transcriptions of an unfamil-
iar language, you will want to include as much detail as possible, but once you have
a grasp of the language you may choose to ignore sounds that are not meaningful or
those whose patterns are now understood. As an example, English ‘cat’ often has a
short burst of air after the ‘c’ and a narrow phonetic transcription would be [khæt].
However, once we have understood the aspiration rule, [kæt] might suffice and
indeed be preferable (see Gussenhoven & Jacobs 2011, p. 16). (In English, aspiration
occurs when the stop is the only segment in an onset of a stressed syllable.) Thus, it is
DOI: 10.4324/9780203701294-5
This chapter has been made available under a CC-BY-NC-ND license.
97
syllable structure, stress) can cause sound changes such as aspiration in English ‘cat’
[khæt]. Other environments to pay attention to are word and morpheme bounda-
ries. For those primarily interested in syntax or semantics, transcription of anything
longer than a word will be impossible until you have a good grasp of the phonology,
and for this you need to begin with words in isolation.
One of the questions often asked is how many words you need to record to work
out the phoneme inventory of a language. Crowley (2007, p. 97) estimates, on aver-
age, 300–400 words are needed while Chelliah & de Reuse (2011, p. 252) estimate
500–700. However, really it depends on the complexity of the sound system. In
some languages there may be rare phonemes that are only encountered once a large
vocabulary has been collected. Furthermore, having the ideal number of vocabulary
items is only one part of the equation. In languages that use phonological contrasts
that do not exist in the languages spoken by the linguist, a major challenge is being
able to identify and ‘hear’ these contrasts. Some common sounds English speakers
have difficulties hearing when working on a new language include aspiration, ejec-
tives and nasalisation, palatal stops, initial velar nasals and glottal stops (for further
examples see Bowern (2015, pp. 44–45)).
The simplest tools to help with this task are a pair of good ears and eyes to pay
careful attention to the speaker’s articulation. You should watch a speaker’s mouth,
as some articulations can be seen more easily than others, and may provide impor-
tant information about phonological contrasts, for example lip-rounding and artic-
ulation of the tongue tip. You should also observe a speaker’s responses to your
attempt to articulate the sound and consider any native speaker’s intuitions on con-
trasts. After the elicitation session you might use phonetic analysis tools to look at
the acoustic properties (e.g. spectral and waveform analysis) or you might have col-
leagues or students who can put their ears to the task too. You might also consider
making instrumental recordings, some of which we discuss in §5.10.
Your first fieldwork session might involve eliciting vocabulary items from within
a simple semantic domain, as discussed in §3.4. In this section we discuss how to
analyse these words to understand the sound system of a language. Note too that
these words will inform the beginning of a dictionary (§7.5). Simple elicitation
provides comparable data for phonological analysis as it consists of words said in
isolation by the same speaker and is likely to consist of words that are of the same
parts of speech, often nouns. When conducting the first fieldwork session you will
be getting to know your speaker, watching your recording equipment, listening
and trying to write down the words. You should also be thinking about the sounds
you are hearing and asking yourself questions about these. For example:
• Is there a voicing or geminate contrast in the stops?
• Is there a contrast between nasalised and oral vowels, short and long vowels?
• Are there ingressive, ejective, glottal or click sounds?
• How many coronal contrasts are there?
9
For teachers of a field methods class, the entire list of semantic domains can
be divided among the students. Following each student’s elicitation and tran-
scription, a combined wordlist can be created. Do any of the same words
appear in different elicitation sessions? What are the differences between stu-
dents’ transcriptions? Might the variation reflect a different first language?
In this section we suggest how to organise and analyse your data to work out the
phonemes and identify variation. Now that you have done some transcription you
will need to take stock of all the sounds you have and their distribution.
Compare each sound in all its different contexts (e.g. word-initial, word-medial,
word-final; stressed, unstressed), as contrasts may not occur in all available posi-
tions. For example, chart all instances of nasalised vowels preceded by a stop. Keep
track of the charts of sounds you make. Keep your analysis in a diary or at least
separate from your fieldwork notes, as it is important to keep a track of your analy-
sis. As you produce your charts and analysis, you should ask yourself if there are any
gaps in the inventories. For example, does the velar nasal appear in all positions in
a word? Are there pairs of voiced and unvoiced stops for all places of articulation?
Are all homorganic nasal stop clusters attested, or is one place of articulation miss-
ing? Identify the phonological categories with the most segmental contrasts and
10
those with the least. Are there any rare sounds? If so, consider whether these could
be onomatopoeic words, ideophones, words from another speech register, or even
loanwords. For example, perhaps your data has both voiced and unvoiced stops at
all places of articulation except for alveolar, or perhaps stops are represented in all
positions of the word, while taps and laterals are only attested intervocalically. We
will now consider some data and walk through the process of phonemic analysis,
drawing on Hayes (2011) and Gussenhoven & Jacobs (2011).
Table 5.1
Ten Kaytetye words from an elicitation session, 20151029MT
Table 5.2
Elicited words assembled by vowel sounds (underlining marks the relevant vowel)
[ɪˈʎemə] (9)
[eˈnɐpə] (10)
Table 5.3
Chart of local environments of vowel sounds
#__ɾ (4) #__ɾ (2) #__ʎ (9) p__# (1), (3), (10)
#__ʎ (1) #__n (10) ˈk__c (7) m__# (2), (4), (9)
#__ n (8)
t
j__l (8)
c__ɾ (7)
ˈc__k (5)
ˈn__p (10)
104
At this stage, inspect the columns to see if there are any general patterns. We can see
that [ɐ]in column one occurs in the most environments and that there is no general
pattern to the environment it occurs in. Looking at column two, we can see that [e]
has fewer environments. It occurs in word-initial and only stressed medial positions.
Comparing columns one and two we can see a minimal pair environment in the first
row (2) and (4) and so [ɐ] and [e] must be separate phonemes. Looking at column
three we can see that [ɪ] only occurs before a palatal sound [ʎ], [c], [j] and [ɲ], whereas
[e] in column two never does. Therefore [ɪ] and [e] are in complementary distribution
and so are likely to be allophones. Looking at column four, [ə] occurs overwhelmingly
in word-final position but it also occurs in medial unstressed position in (8), as does [ɐ]
in (7). The sounds [ɐ] and [ə] contrast in medial unstressed position and must there-
fore belong to two separate phonemes. The schwa occurs in word-final and unstressed
positions, whereas [e] occurs in word-initial and stressed environments, thus they are
in complementary distribution; [ə] is thus an allophone of [e]. The environments of
the four vowel sounds are written as follows. Two vowel phonemes can be proposed:
/ɐ/ /e/
By comparing columns of sounds in this way, it should soon become clear whether the
sounds are in complementary distribution or not. While adjacent sounds are the
basis for much allophonic variation, other environments also give rise to it (e.g.
vowel harmony). Charting these environments is time-consuming, but you will be
at an advantage as you gain more experience with different l anguages because many
of the same rules crop up across languages.
Target sounds should also be arranged into phonetic charts based on place and
manner of articulation. That way you can see whether phonological rules apply to
all sounds that share a particular phonetic feature or set of features, which is usu-
ally the case. For example, voiced laterals and nasals are both sonorant sounds and
in some languages they can cause the adjacent segment to become voiced. When
making charts to see the distribution of voiced and unvoiced consonants, pay atten-
tion to the distribution of such sounds. Of course, languages make use of different
sounds and so the exact sounds that constitute a ‘natural class’ will vary from lan-
guage to language.
1. Make a row of each stop and tap sound and list the words in columns
as per Table 5.2 (we will not analyse vowels, nasals or word stress in this
exercise).
2. Construct a table of the local environments for each stop and tap sound,
as per Table 5.3.
3. Chart any complementary distribution and describe which sounds are in
complementary distribution.
4. Identify any allophony rules and propose a set of stop phonemes and allo-
phones. Explain your motivation for selecting which ones are phonemes
and which are allophones.
If the language has contrasts you are not used to, it may be difficult to decide if
two words are minimal pairs or homophones (see §7.6). In some cases you may
get a reliable response by asking speakers if they think the two words sound
the same or different, but it is not always clear what speakers are responding to
106
in their judgements. Some speakers may claim that two words sound different
simply because their semantics differ or because the two words usually have dif-
ferent intonation patterns (especially if the pair of words are from different parts
of speech).
A helpful technique to distinguish pairs of words is to play recordings of the two
potential homophones back to a speaker and ask which word has which meaning. If
the response is consistently ‘different’, then the words are probably a minimal pair.
In this case, check again in Praat for any phonetic differences. If the responses are
not consistent, the two words may well be homophones. Try to do this with as many
speakers as possible to get independent judgements.
A general technique used by Ladefoged (2003) to check pronunciation is to
ask, “(w)hich of the versions of the word sounds better” (rather than “correct”),
holding up one or two fingers each time, for the speaker to answer “first” or
“second”. If you are working with a speaker who is also highly literate in their
language, it can be helpful to ask them to spell the words. The best solution is
to collect multiple occurrences of the two words in natural speech and compare
these carefully.
Once you have phonetic transcriptions from multiple speakers, chart these,
as in Table 5.4. From these you can put forward a phonemic analysis, as in
the final column of Table 5.4. Phonemicisation is an iterative process of pro-
posing, testing and refining hypotheses about segmental and prosodic tonal
contrasts.
As your data increases and you engage in this process again and again, you should
come to a working phonemicisation from which you can move to a phonemic tran-
scription, as this will enable you to transcribe data more efficiently (unless your
fieldwork aims are solely phonetic). In ELAN, you can always go back and add a
107
Table 5.4
Comparing different Kaytetye speakers’ pronunciations of the same words –superscript ‘h’ represents
aspiration and the small circle devoicing
phonetic transcription tier to do more targeted analysis if your aim is phonetic anal-
ysis. Don’t move to a phonemic transcription until you are sure of the phonology,
because if what you represented as a phoneme actually turns out to be two contras-
tive sounds, this will require much more work to fix. Note that if you make subse-
quent changes to your representation system, this should be documented, including
the date in which the change is introduced. Use your notebook to keep track of your
data and analyses.
Identifying the number and the nature of tonal categories is a complex task that
may vary between dialects of a language and between different analysts. A use-
ful resource for fieldwork on a tonal language is Morey (2014). A tonal lan-
guage is one where pitch enters into the lexical realisation of at least some lexical
items. The minimum number of tonal categories in a language is two while the
maximum so far attested is twenty. It is important to first document uninflected
words, as the tonal category of a word can change when inflected. Morey recom-
mends identifying tonal contrasts in words before exploring the phonetics of
tone in detail, as the acoustic properties of tonal categories can be complex and
varied. For example, a category may be based on a combination of pitch, phona-
tion, duration and intensity and different speakers may realise pitch categories
differently.
Native speaker intuitions about a tonal category of a word can provide evidence
for your analysis. Beware, however, as different speakers may take the initial or
final pitch of a tone as the point of comparison. It is useful to distinguish the ana-
lyst’s perception of a tonal category from that of a native speaker’s perception of a
tonal category in a word. One way of checking the tonal category of a word with
a native speaker is to whistle it. Morey (2014, p. 647) describes another method
where he (SM) repeats a word, each time with a different tonal category, and asks
the speaker (Aije Let) to identify the correct category (Figure 5.1).
Tonal categories are not always carried across in related languages. However,
whether they are or aren’t tends to be consistent across all cognate words (Morey,
2014, p. 639), so make sure you compare many words that have a given tonal
108
Figure 5.1
Drawing on native speaker intuitions about tone in Tai Phake
Source: Morey 2014, p. 647
Aije Let: kin6 / kin1 kin2 kin3 kin4 kin5 kin6 kin6 / hok1
“Not eat? kin1 kin2 kin3 kin4 kin5 kin6, it is number six.”
category, not just one or two. Be aware that the functional load that tone plays can
vary between closely related languages and tonal categories may diminish if the
language is in contact with non-tonal languages.
Morey (2014) identifies and discusses four ways that tone can be represented in
analysis, noting that above-vowel diacritics are recommended by the IPA. Practical
orthographies mark tone in different ways, including underspecifying some con-
trasts or not marking tone at all. Some use diacritics, while others combine vowel
and tone into a single symbol.
within the words (e.g. open vs closed; simple vs complex; onset vs no onset) and the
relative frequency of these structures and types. Similarly, you will need to chart all
open syllables, complex consonant positions etc.
Prosody is the intonational system of a language. Until recently, such analyses
have been drawn directly from field data elicited for other purposes. Chelliah & de
Reuse (2011, p. 275) observe that you should make notes and try out the intonational
contours of different sentence types, such as commands, statements, interrogatives
and different focus constructions such as clefts. You should also investigate the rela-
tionship between contours and discourse type, such as arguments, teasing, baby-
talk and complaints. See Himmelmann & Ladd (2008) for a detailed discussion of
fieldwork on prosody and ways to obtain controlled data designed specifically for
intonational research.
After you have done an initial analysis of the sounds in a language, you may have
a list of variants for which you are not sure if there is a conditioning environment
or whether they are simply free variants. For example, you might not be sure
whether the tap [ɾ] and [t]are allophones or free variants produced by differ-
ent speakers; or you might have different vowel lengths and want to know what
factors condition their allophony. You might therefore design a task that aims to
get speakers to say words that have these sounds while controlling the linguis-
tic environment in which they are said. Like all experiments, these tasks have
a dependent variable and one or more predictors (or independent variables).
For example, the dependent variable might be vowel length and two predictors
might be the type of syllable (open or closed) and its position in the word (e.g.
stressed/unstressed, initial/medial/final). The experiment would aim to measure
the effects of the predictors (syllable type and word position) on the dependent
variable (vowel length). While these experiments require reasonable numbers of
speakers to produce statistical results, as discussed in §9.4.2 and §[Link], they
can also be done with just one or two speakers to gain an understanding of an
individual speaker’s sound system.
Your analysis of categories should first look to the stressed position to find the
maximal number of contrasts, as contrasts are often reduced in unstressed posi-
tions. Develop a hypothesis about what is triggering a particular variation; for
example, are geminates and singletons dependent on preceding vowel length? Do
nasals become prestopped in a stressed/unstressed environment? Do stops become
voiced following a sonorant consonant? Does a three-way tonal contrast reduce in
an unstressed position?
You will need to prepare a list of all words that contain the target sound which
can confirm or refute the hypothesis. As an example, take the hypothesis that
stops become voiced following a sonorant consonant. To test this, you will need
to record words that have a sequence of a sonorant plus stop, as well as non-
sonorant plus stop in them, including intervocalically and word-initially. You will
10
Table 5.5
Gap in the attested consonant clusters in a hypothetical language: is /ɾk/non-permissible or an
accidental gap?
labial velar
the experiment will also help to control for ‘list intonation’ –that is, a set intonation
pattern where the preceding and last word in a list have different phonetic features.
For example, the final word of a list or repetition may have a lower pitch and longer
duration than words in the other positions. This can be especially problematic if
working on a language with tonal or durational contrasts. Make sure the participant
knows that the word will come up again a number of times so that they do not think
they have made a mistake when it recurs.
You will need to devise some way to prompt the speaker to say the target
word in the carrier phrase. If the participant is literate, you could have them
read it. If they are not literate, you can make a recording of the phrase first
and then use this as a prompt. That is, ask the consultant to repeat after they
hear each utterance. The benefit of using a pre-recorded frame is that it will
be the same for every consultant and every repetition of the token throughout
the experiment. If possible, make the recording with a native speaker and add
visual prompts (see Figure 5.2). This will certainly make the task less boring
for the speakers.
You should consider whether you need a mix of different ages (for example in
cases of language shift), as well as men and women (if sociolinguistic variation
seems to be at play). This will depend on the research questions you are asking.
Figure 5.2
An example of one of the 35 Kaytetye target words and corresponding images (here ‘in the scrub’
[ɐˈʈɳeŋ]) in a carrier phrase ‘Say X!’, pre-recorded by a native speaker
Sources: Harvey, Davies et al., 2015; Harvey, Lin et al., 2015
artnenge
12
Unless the questions relate to gender and age, it is better to keep the participant
group as homogenous as possible to avoid additional variables in the analysis. If
you are investigating interdental consonants, you should avoid working with any-
one who is missing front teeth! Pay close attention to views within the community
about anyone who is said to have a ‘bad tongue’ or other comments that might sug-
gest someone with unusual speech. Ideally, the speakers would also be monolingual
or mother-tongue speakers, although this is not practical in many cultures where
multilingualism is the norm.
In explaining the task, you might want to say that different people say words a
bit differently, and that you want to make sure you record all the ways in which
people say a word. Even if people understand why you want to do these sorts
of experiments, they are not always easy for speakers to do. In free word order
languages, it can be very difficult for speakers to fix the position of the target
word. Asking speakers to use a test frame also adds to the many factors that make
such elicitation an unnatural activity and thus may lead to increased speech
errors. Therefore training speakers in using the carrier phrase is highly valuable.
To help put speakers at ease, it can be useful to turn the training activity into a
fun group activity.
In this section, we give a brief overview of basic acoustic recording techniques for
phonetic fieldwork. Instrumental phonetic fieldwork employs a range of specialist
equipment and software for processing data. These are beyond the scope of this
book and we refer readers to Ladefoged (2003).
When making phonetic recordings, it is crucial to avoid background noise. If you
are recording indoors, it will be necessary to reduce echo. This can be done by put-
ting foam (e.g. mattresses) or blankets on the wall and by minimising the distance
between the mouth and microphone. Butcher (2013, p. 65) recommends a head-
mounted microphone positioned approximately 5 cm from the lips and slightly to
the side (§3.2.2). If using a freestanding microphone, Butcher recommends these be
placed 10–15 cm from the lips and put on a shock mount fixed to a boom arm that
can be clamped to a table. A foam box or ‘Porta-Booth’ put around the microphone
can also reduce reverb. With the small distance between mouth and microphone,
you will need to do a ‘dry run’ with each speaker to set the recording level to capture
the loudest signal level without distortion. Butcher (2013, p. 65) also recommends
paying “particular attention to the presence of ‘50 Hz hum’ (60 Hz in the United
States) resulting from interference from the power supply”. Running the equipment
on battery power can resolve this, but if the problem persists, you may need to
invest in a ‘hum eliminator’ (otherwise known as a galvanic isolation transformer).
When recruiting participants, men may be better for some kinds of acoustic
research because their Fundamental Frequency, or F0, is lower than in women, thus
providing richer information about formants. If you want to pursue fieldwork on
13
specific aspects of articulation, you should be aware of the following tools of instru-
mental phonetics that can be taken into the field:
• Oral and nasal airflow masks can be used to record the amount of airflow
and/or pressure from the nose and mouth.
• Ultrasound and electropalatography are used for studying how the tongue
moves during articulation.
• Static palatography is used for studying where articulations are made and
with which part of the tongue.
• Layryngeography or glottography (EGG) is used for measuring the timing
of vocal fold vibration and distinguishing phonation types such as breathi-
ness and glottalisation (Butcher 2013, p. 75).
For the area of perceptual phonetic fieldwork, see Butcher (2013). We also rec-
ommend Ladefoged (2003), who discusses the use of these methods in the field.
As well as the tools mentioned above, it is also possible to innovate and use non-
specialised equipment to make measurements, as Jesse Stewart describes.
Figure 5.3
Limbardo Payaguaje, a Shiwiar speaker, demonstrates the use of the earbuds. The updated
methodology positions just one earbud under (rather than in) the nose and one next to the
mouth. Limbardo was keen to demonstrate the original positioning which he found humorous
(Photo: Eduardo Portilla, used with permission of Limbardo Payaguaje)
5.11 ORTHOGRAPHY
Some endangered languages have never been written down, while many that have
been are not written in any standardised way. This can be addressed by creating
an orthography, a set of conventions for writing a language. As well as stipulat-
ing the rules for spelling, an orthography can include rules about hyphenation,
capitalisation, word breaks and punctuation. An orthography can be useful for the
community for a number of reasons. The community may wish to read and write
their language, especially if they are literate in another language. An orthography
can also assist in language teaching, especially in revitalisation contexts and when
the language is being taught as a second language. Having a written form can also
enhance the prestige and value of a language, both within and outside of the com-
munity. Written forms of the language are often used proudly on signs, as names
for sports teams, rock bands and as tattoos, and can be found on the internet, social
media and in church. However, it may be that some people do not want their lan-
guage written down, as they may see it as having a standardising effect on their
oral language or lead to weakening of memory skills. Broncho suggests it can also
15
be a “path for outsiders to potentially exploit cultural knowledge intended only for
Native people”.3 If the community wants an orthography, there are socio-political
factors as well as phonological arguments to consider when developing one. If you
are starting from scratch, you will need to approach this differently than if you are
working with an existing but perhaps poor orthography.
fluency and be associated with errors such as metathesis. For homorganic con-
sonant sequences it is common to reduce any digraphs required, e.g. [ɲc] is often
written ‘nty’ rather than ‘nyty’, unless of course there is a contrast between [ɲc]
and [nc].
If you are working with an orthography that is not phonemic and under-
differentiates the phonetic form, this will make looking up a word in a dictionary
cumbersome (though not impossible). If this is the case, it is useful to have a field
in the dictionary for the phonemic or phonetic form (in the SIL Multi-Dictionary
Formatter (MDF) field marker \ph is used for this (§7.6)). An example of this can
be seen in §7.6.3 from the Kalam Dictionary. In communities where there are mul-
tiple orthographies, a possibility is to have a field in the dictionary for the alternate
orthography(s). If using SIL dictionary-making software such as Toolbox or FLEx
(§7.7) you will need to add a field for this.
Early Missionary (EM) New Missionary (NM) New Secular (NS) English
c. 1880s–1980s 1990/2006 1980s/2000 gloss
1 aldola alturla alturle west
2 Jabalpa Yaparlpa Yaperlpe Glen Helen
(place name)
3 kwara kwaarra kwarre girl
4 kwata kwaarta kwarte egg
5 knulja kngulya kngwelye dog
6 mankama maangkama mangkeme grow
word EM NM NS Difference
1 aldola alturla alturle d (EM), t(N)
aldola alturla alturle o (EM), u(N)
aldola alturla alturle l (EM), rl (N)
aldola alturla alturle e# (NS), a# (M)
2 etc.
18
2. Compare the orthographies and make a numbered list of (a) the differ-
ences in consonants (b) the differences in vowels as follows:
letter(s) EM NM NS word
5.12 SUMMARY
In this chapter, we have discussed how to transcribe and analyse words to work out
the sound system of a language. Listening and watching speakers closely will put you
in good stead to make the most out of the elicitation sessions. Organising sounds
in their different environments meticulously will help to find patterns in the data
and enable you to ask key questions and better design elicitation sessions. This is an
iterative process of adding data, charting sounds, identifying gaps, forming hypoth-
eses and eliciting more data. We also described how to design and run experiments
to ascertain the effects of neighbouring sounds, stress and phonotactics. We briefly
introduced a number of instrumental recording techniques and equipment for pho-
netic analysis. This chapter also outlined factors to keep in mind when developing an
orthography from scratch, including the phonemic principle, co-opting an existing
script and symbols, and taking into consideration factors influencing literacy fluency.
The success of an orthography was also discussed in terms of non-linguistic factors as
well as phonological ones.
There are excellent chapters on doing fieldwork on the sound systems of a language
in Chelliah & de Reuse (2011, Chapter 10) and Bowern (2015, Chapter 5); and a
good checklist in Sakel & Everett (2012, p. 159). For general guidelines for fieldwork
on phonology see Van de Veen & Medjo Mvé (2010) and for fieldwork on the pho-
nology of tonal languages see Morey (2014). A good introduction to phonological
analysis is Gussenhoven & Jacobs (2011) as well as Hayes (2011). For fieldwork in
19
phonetics see Ladefoged (2003), Himmelmann & Ladd (2008) and Butcher (2013).
For issues in developing an orthography see Bird (2001).
NOTES
REFERENCES
Bird, S. (2001). Orthography and identity in Cameroon. Written Language & Literacy, 4(2),
131–162.
Bowern, C. (2015). Linguistic fieldwork: A practical guide. Basingstoke (England): Palgrave
Macmillan.
Boersma, P. & Weenink, D. (2017). Praat: doing phonetics by computer [Computer pro-
gram]. Version 6.0.28, retrieved 23 March 2017 from [Link]/.
Butcher, A. (2013). Research methods in phonetic fieldwork. In M. Jones & R.-A. Knight
(Eds.), The Bloomsbury companion to phonetics (pp. 57–78). New York, NY: Bloomsbury
Academic.
Chelliah, S., & de Reuse, W. (2011). Handbook of descriptive linguistic fieldwork. Heidelberg:
Springer.
Crowley, T. (2007). Field linguistics: A beginner’s guide. Oxford: Oxford University Press.
Gallagher, G. (2013). Speaker awareness of non-local ejective phonotactics in Cochabamba
Quechua. Natural Language & Linguistic Theory, 31, 1067–1099.
Gussenhoven, C., & Jacobs, H. (2011). Understanding phonology. London: Routledge.
Halle, M., & Keyser, J. (1971). English stress: Its form, its growth and its role in verse. New York,
NY: Harper and Row.
Harvey, M., Davies, B., Lin, S., Turpin, M., Demuth, K., & Ross, A. (2015). Two types of
prestopping in Kaytetye. In Proceedings of the 49th Annual Meeting of the Chicago
Linguistic Society, 145–152.
Harvey, M., Lin, S., Davies, B., Turpin, M., Ross, A., & Demuth, K. (2015). Contrastive and
non-contrastive pre-stopping in Kaytetye. Australian Journal of Linguistics, 35(3), 1–19.
Hayes, B. (2011). Introductory phonology. Oxford: Wiley-Blackwell.
Himmelmann, N. P., & Ladd, D. R. (2008). Prosodic description: An introduction for field-
workers. Language Documentation & Conservation, 2(2), 244–274.
International Phonetic Association. (1999). Handbook of the International Phonetic
Association: A guide to the use of the International Phonetic Alphabet. Cambridge:
Cambridge University Press.
Jakobson, R. (1960). Closing statement: Linguistics and poetics. In T. A. Sebeok (Ed.), Style
in language (pp. 350–377). Cambridge, MA: MIT Press.
Kenny, A. (2017). Aranda, Arrernte or Arrarnta? The politics of orthography and identity on
the Upper Finke River. Oceania, 87(3).
Kiparsky, P. (1977). The rhythmic structure of English verse. Linguistic Inquiry, 8, 189–247.
120
Morpho-syntax
By now you will have done a lot of single word elicitations and have a grasp of the
phonology. The next step is to tackle phrases and clauses and get underway ana-
lysing the morpho-syntax. If you are a community member or have been working
on the language for many years, you may have a comprehensive corpus that you
can mine for an outline of a grammar, which will in turn provide the basis for
further morpho-syntactic interrogation. For linguists new to the language and
community, narratives, conversation etc. are often too complex a place to start.
Speakers often want to train you up in the language to an extent that they consider
you a worthy interlocutor or listener. So clause-based elicitation is often a good
starting point.
DOI: 10.4324/9780203701294-6
This chapter has been made available under a CC-BY-NC-ND license.
12
122 Morpho-syntax
There is an ongoing debate about the kind of data you need to do grammatical descrip-
tion. This debate is closely tied to the shift from descriptive linguistics, which is char-
acterised by the classic Boasian trilogy of grammar, dictionary and text collection, to
language documentation, which emphasises recording a broader range of language,
using methodologies which allow for different types of potential analyses, including
grammatical description (Himmelmann, 1998) (§1.2.4). The language documenta-
tion approach has formal elicitation at one extreme and free-ranging conversation at
the other. Both have advocates who claim one or the other is the preferred data for
grammatical description. Elicited data is often not considered ‘real’ language because
it is constructed and relatively context free. Language is, of course, a dynamic and
interactive system. On the other hand, conversational data is often fragmented and
not revealing of the potential grammatical range of a language because people often
don’t talk to each other in full sentences. The reasonable mid-position for grammati-
cal description is a combination of narrative, conversation, procedural and elicitation
data. It is also important to remember that you will never find out what is ungram-
matical without elicitation because the absence of a structure in a corpus might just
relate to its rarity rather than to its grammaticality.
Data from formal elicitation can prove highly insightful in trying to tease out
what parts of a clause are contributing to various meanings. But where possi-
ble, it should be backed up with naturally occurring data from conversation or
narrative. Note that this is not always possible. Some constructions are rare and
your corpus may not capture them. For example, Nash (1996) discusses the pos-
sible orders and combinations of pronoun clitics in Warlpiri (Pama-Nyungan,
Australia). He established through formal elicitation that it is possible to use a
sequence of four pronouns, even though he was unable to find this sequence in
spontaneous speech.
The approximately 60,000 lines of Warlpiri material available in machine-readable form
conform absolutely to the generalisation that -rla-jinta cannot follow any (non-zero)
123
Morpho-syntax 123
non-subject pronominal clitic [FM: i.e. four pronouns are not possible]. Yet several
Warlpiri speakers on different occasions have agreed that such a combination is well-
formed and interpretable.
(Nash, 1996, p. 132)
If you are an ‘insider’ linguist, the question of which language to perform elicita-
tion in requires no answer. Clearly you would operate in the language under inves-
tigation because you speak it. If you are an ‘outsider’ linguist, the same question
is less straightforward. Ultimately using the target language for elicitation is best
practice. This approach is called ‘monolingual fieldwork’ (cf. Everett, 2001) and
was long advocated for by Ken Hale (1983) who, in anticipation of a comparative
Arandic dictionary for Central Australia, gained mastery of Arandic languages
with his usual legendary speed and used one Arandic language to elicit other
related languages. Indeed in some communities who do not speak a regional lan-
guage such as English, French or Spanish, this is the only approach possible.
It might not be practical or possible to do monolingual fieldwork due to factors out
of your control, for example your prowess as a second language learner. It is wrong to
think that all linguists are good language learners. Not all of us are (and this does not
make you a bad analyst of language)! It also might be hard to learn the language if it is
highly endangered and is only spoken by older members of a speech community. For
example, if you are working on a language in Vanuatu and living with a family, the
language of interaction in the household might be Bislama (an English-based creole
language) rather than the target language. You have a better chance of becoming a
good Bislama speaker than learning the target language. If you do use an intermedi-
ary language such as English, Spanish or a contact language as the language of elicita-
tion, there are many checks you can perform to make sure you are understanding the
structures of the target language correctly, such as ‘back translation’ (§6.6.2).
124 Morpho-syntax
Formal elicitation produces data with the least contextual information (i.e. dis-
course context or real world information) and therefore requires the least equip-
ment. Some people advocate using video for all types of language recordings in
all contexts to make sure every recording is potentially of use to many different
research areas such as gesture studies (e.g. Seyfeddinipur, 2012). For most formal
elicitation, except a few areas of grammar such as spatial relations (§6.7.5) and for
all sign language elicitation (§8.4.4), a notebook and pen (§3.3.4) plus an audio
recorder (§3.2.1 and §3.3.2) and a lapel microphone (§3.2.2) will suffice. After the
recording session, you can also use a transcription program such as CLAN or ELAN
to transfer your notes into a sound-linked digital form (§4.3.2). Note that if you are
writing down sentences as the speaker says them, the speaker might talk slowly and
in a fragmented manner to accommodate your learner status in their language. It
might be best to wait until you are at the point you can discard the notebook as a
transcription tool and just try out sentences on speakers orally and more fluently.
That’s a good point to start using ELAN or CLAN to transcribe audio recordings
after elicitation sessions. The sound-linked utterances will then be a good source
of example sentences for a multi-media dictionary (§7.7.5) or even a sound-linked
grammar (§6.8).
Table 6.1
Tips for determining some common word classes
126 Morpho-syntax
The key to good grammatical elicitation is changing the prompt sentences incre-
mentally. This is a similar principle to the use of minimal pairs in phonology
to establish contrastive sounds, i.e. phonemes (§5.4.1). In the case of gram-
mar, minimally different clause structures establish meaningful units in the
morpho-syntax. The trick with grammatical elicitation is not to change more
than one category at a time because these categories might be interdependent.
For example, if you are interested in the way the ergative marker works in an
Indo-Iranian language, such as Hindi, you don’t want to change both the tran-
sitivity and perfectivity of the sentence because both of these affect the use of
ergative marking (ergative case is used on subjects of transitive verbs marked for
perfective aspect).
In the following English examples you can see how sentences are changed incre-
mentally to reveal grammatical relations, tense and number. The bolded parts show
the changes in the clauses:
At the end of each elicitation session, write up your current ideas on the gram-
matical categories of the language and devise follow-up sentences which explore
particular categories in more depth.
127
Morpho-syntax 127
128 Morpho-syntax
are they referring to the whole animal, part of it, its action or an exclamation
“Hey a rabbit!”? Put the word in a sentence, and try inflecting it as a noun
or verb.
• The use of a vernacular language, such as a colonial English or variety or
creole, can be problematic. As an L2 learner of the variety, you need to be
careful that words mean what you think they mean. For example, if some-
one exclaims “Oh here comes that cheeky dog!” in Australian Kriol –adopt
a defensive pose. Don’t try to pat the dog! Cheeky means ‘dangerous’ in
Kriol. And in general, make sure you record translations of these words
using proper translations not pseudo-English/French/Spanish etc. which
doesn’t distinguish the semantics of the standard and vernacular varieties.
Don’t assume that the vernacular language forms have the same semantics
as the standard form of the language.
• When you are fairly certain of your analysis and you start glossing exam-
ples, use the Leipzig Glossing Rules,1 and where morphemes do not have a
gloss, use the conventions developed by other linguists for languages in the
same language family or region. Do not invent new glosses (even if your
analysis of the category is slightly different from others, in which case you
should describe this in your grammar). It causes no end of headaches for
other linguists and community members reading these grammars trying to
figure out your category if you invent new terms.
Morpho-syntax 129
When you come to writing up your grammar, where possible, try to use exam-
ple sentences from your corpus that reflect that culture of the speech community.
Of course you will need to resort to elicited examples for rare constructions (§6.1).
130 Morpho-syntax
Human languages are similar in some ways, but incredibly diverse in others.
Making sense of this diversity is one factor that drives field linguists to docu-
ment language after language. The discovery of new linguistic categories and
the different instantiations of well-known categories contribute much to under-
standing the design space of language. This section covers many of the major
linguistic categories that most languages encode –core grammatical relations,
verb distinctions, pronoun distinctions, spatial relations, number and informa-
tion structure; as well as major language family-specific categories such as noun
classes and evidentiality. We also discuss major morphological categories such
as clitics vs affixes and derivation vs inflection. Throughout these sections, we
give references to key typological works which provide cross-linguistic surveys
of particular constructions and parts of speech. Other excellent resources are
the World Atlas of Language Structures (WALS)2 and the Atlas of Pidgin and
Creole Structures (APiCS).3 Also helpful is Tim Shopen’s (2007) classic series
‘Language typology and syntactic description’ and Thomas Payne’s (1997)
Describing morphosyntax: A guide for field linguists. Another great resource is
the ‘Typological tools for linguistic research’ questionnaire series developed by
the MPI for Evolutionary Anthropology in Leipzig.4 Use these references, as well
13
Morpho-syntax 131
The main ways nouns distinguish grammatical roles are by word order (e.g. English),
case-marking (e.g. Finnish) or both word order and case-marking (e.g. German).
Pronouns, which are a closed class, often distinguish roles using different forms i.e.
suppletive case-marking (e.g. English) rather than regular case-markers (§6.7.4).
Usually either A or O is marked the same as S (through word order, morphology or
form). When A and S are grouped together (typically unmarked), and O is distin-
guished by case-marking or form, you have an accusative system. When O and S
are grouped together (typically unmarked), and A is distinguished by some kind of
marking or form, you have an ergative system (Figure 6.1).
For example, in German, grammatical relations are marked through case in the
determiner system. German is an accusative language because A and S determiners
have the same form e.g. der (masculine) 3(a) and (c). Word order also helps.
German is a verb second language and the subject usually comes before the verb but
it can also come after the verb 3(c). Follow the bolded determiners in (3) to see how
they group subjects and distinguish the object from subjects.
Figure 6.1
Groupings of A (transitive subject), S (intransitive subject) and O (object) in an accusative system
(non-dotted line) and ergative (dotted line) system
transitive clause A O
intransitive clause S
132
132 Morpho-syntax
English groups A and S through word order (both are preverbal). Other languages
such as some Berber, Cushitic, Omotic (East Africa), Yuman (California) and
Austronesian languages mark A and S morphologically and leave O unmarked.
These languages are called marked-s or marked-nominative languages
(Handschuh, 2015). In contrast, Pitjantjatjara shows an ergative pattern because S
and O are unmarked, and A is marked morphologically.
Some languages have internal splits between accusative systems and ergative sys-
tems (but see Goddard, 1982). Silverstein (1976) explains ergativity splits based on
the inherent lexical content (with relevant features being person and animacy) of a
referring expression. Pronouns are more likely to pattern as accusative and nouns
are more likely to pattern as ergative (Figure 6.2).
13
Morpho-syntax 133
Figure 6.2
Silverstein’s hierarchy and ergative/accusative splits
To find out whether the language marks these roles differently depending on the
Silverstein hierarchy, the following sequence of elicitation sentences can be useful:
Other argument relations systems to look out for are direct-inverse systems, com-
mon in Algonquian and Athabaskan languages (Klaiman, 1992). These systems
consist of two configurations –the direct construction, usually unmarked, is used
when the subject of the transitive clause outranks the object in the person hierarchy,
and the inverse is used when the object outranks the subject. Sentences (7)(d)–(g)
can help determine whether this system is found in the language.
Case may also be optionally marked in some languages, with the grammatical
relation of the referring expression unaffected. Optional ergativity is common in
Tibeto-Burman and Australian languages and is often driven by animacy, informa-
tion structure and perfectivity (McGregor, 2010b). Objects may also be optionally
marked (or receive different case-markers) and this is called differential object
marking, for example Hindi (Indo-Aryan, India) marks human objects with the
accusative case suffix –ko, but only optionally marks non-human objects. Where
non-human objects are marked they are definite (de Hoop & de Swart, 2008). You
will need to use different techniques other than elicitation to sort out this type of
variation (see §10.3.3 for some methods).
134 Morpho-syntax
Figure 6.3
Some Aktionsart categories
Durative
Stative Event
SITUATION TYPE Semelfactive
Dynamic
Telic
Process
Atelic
inflected verbs also often differ, forming different conjugations or verb classes.
Voice and directional categories such as hither, thither and associated motion
(see §6.7.5) are also often marked on verbs. Verbs can also combine with other
verbs, forming complex predicates such as serial verbs. This section deals with
each of these categories in turn.
Aktionsart is often categorised according to the kinds of situation types seen in
Figure 6.3. You can try eliciting the following types of sentences to determine the
Aktionsart of verbs (but note that not all tests will be available in all languages).
Unlike dynamic verbs e.g. drive, hit, sneeze, learn, write, stative verbs e.g. be,
have, remain, know, love, fear will not combine with a progressive marker (8)(c)
and generally can’t be used in imperatives (8)(d). Semelfactive verbs, as opposed
to durative verbs, refer to an event which takes an instant to accomplish and can
be elicited using punctual adverbials (8)(e). Time-span adverbials can be used to
distinguish atelic from telic verbs.
Aktionsart interacts with grammatical aspect which is used to pick out some tem-
porally specifiable part of a situation. Simplified aspectual contrasts can be repre-
sented as in Figure 6.4.
Habitual aspect expresses the occurrence of an event or state as characteristic
of a period of time e.g. I eat dinner. Perfective aspect describes the situation as a
whole with no reference to how it unfolds in time. It ‘telescopes’ a situation to a
point e.g. I ate dinner. Imperfective aspect attends to the internal temporal struc-
ture of a situation e.g. I am eating dinner.
135
Morpho-syntax 135
Figure 6.4
A simplified aspectual schema
Habitual
ASPECT Perfective
Non-Habitual
Imperfective
Some languages carve up time in more detail, making distinctions between remote
and recent past and future. Some languages make no tense distinctions, instead
distinguishing between realis/irrealis mood. They indicate that a certain situation
or action is (not) known to have happened at the time the speaker is talking. Often
languages have a combination of tense and realis distinctions and these can be quite
hard to tease out, for example future tense and potential mood often have a lot of
overlap, and often it is difficult to figure out whether a language has a future/non-
future tense or an irrealis/realis mood distinction. Other types of mood categories
which languages can encode are subjunctive, conditional, imperative, dubitative
and hortative. Payne (1997) provides good contexts and example sentences for sort-
ing out these distinctions.
Often the pattern of inflection in a language categorises verbs into groups,
referred to as conjugation classes. Like noun classes (§6.7.8), verbs of the same
conjugation class have a particular pattern of inflection i.e. members of a given class
typically inflect in the same way (take the same set of inflections) as one another.
Eliciting all of the conjugations for every verb can seem like a daunting task, but
often conjugations are minimally different and there is frequently one form which
distinguishes them. For example in Bilinarra, the potential or imperative inflections
are the best distinguished so you can categorise a verb on the basis of one example
(see Table 6.2).
136
136 Morpho-syntax
Table 6.2
Verb conjugations in Bilinarra, which has five major classes of verbs and a number of irregular verbs
Voice alternations which affect transitive clauses are also encoded on the verb.
Active voice e.g. I held the book is the unmarked configuration of the subject and
object. Passive voice promotes the patient to subjecthood, often demoting the agent
to an optional element and marking the verb in some way e.g. The book was held
by me. Middle voice is another way to express a transitive verb where there is a
sense that the agent is acting on itself e.g. The book flipped open. On the other hand,
the antipassive in ergative languages removes the object from core argument struc-
ture. See some examples from Choctaw (Muskogean, Mississippi/Oklahoma) and
Kabardian (Abkhaz–Adyghe, Caucasus) in (10) and (11).
(10) Choctaw (Foley & Van Valin Jr, 1985, pp. 322–323)
Bill at okhisa an tiwwih.
Bill SUBJ door OBL he_opens_it
Bill opens the door. (Active voice)
Okhisa at tinwah.
door SUBJ open-MID
The door opens. (Middle voice)
137
Morpho-syntax 137
Complex predicates are single verbal predicates such as light verb constructions
and serial verbs which are encoded in two or more types of verbs. The literature on
complex predicates is riddled with a confusion of constructions and terminology,
often specific to particular language families (and even plenty of variation within
the literature on families). If you come across complex predicates, read Butt (2003),
who provides some useful navigational tools through this “jungle” of terminology,
as she calls it. Several general volumes which are also useful are: Aikhenvald and
Dixon (2006) and Amberber, Baker & Harvey (2010).
6.7.3 Possession
Every language has a way of expressing possession. Some good overviews of the
way that possession is marked across the world’s languages include McGregor
(2010a) and Dixon & Aikhenvald (2015). Sometimes possession is also marked
through pronouns (see §6.7.4). Possession is often expressed by marking the
dependent of a noun phrase with genitive case (or often dative case) i.e. John’s
book; or by marking the head of the noun phrase with possessive suffix i.e. John
book-his.
Some languages divide nouns into different types based on the possessive con-
struction they participate in. For example a distinction is often made between
alienable and inalienable possessums (the possessed object). Cross-linguistically,
inalienable nouns tend to be body parts and close kin, and alienable nouns are
all nouns which do not participate in a part-whole relationship (Chappell &
McGregor, 1995).5 Try out these types of sentences and see how speakers express
possession:
138 Morpho-syntax
• Person
○ first, second and third person
○ Sometimes first person pronouns distinguish inclusive (including the
hearer) and exclusive (excluding the hearer) categories
• Gender
○ e.g. no gender distinctions
○ e.g. masculine and feminine in third person pronouns
○ e.g. animacy.
To elicit gender distinctions in pronouns, you will need to take advantage of their
anaphoric function (unless gender is encoded in the first and second person pro-
nouns also). Try a sequence like (13). This sequence changes gender, but you can
use it for other categories e.g. animacy.
(13) (a) The man went to the shop and pro bought an apple.
(b) The woman went to the shop and pro bought an apple.
(c) The bus went to the shop and pro brought lots of people.
139
Morpho-syntax 139
To elicit case distinctions in pronouns, use similar methods suggested for gram-
matical relations above. This sequence changes the transitivity and voice of verbs
and therefore the grammatical role but you can use it for other categories. Again
start with third person because it is less confusing:
140 Morpho-syntax
Table 6.3
Bilinarra pronouns assumed to be a singular –dual –plural system
Table 6.4
Bilinarra pronouns as a minimal-UA-augmented system with the crucial pronoun highlighted
MINIMAL UA AUGMENTED
Other relevant categories for pronouns are possession (§6.7.3), reciprocals (‘they
hugged each other’) and reflexives (‘he hugged himself ’). Two useful typological
overviews are Nedjalkov (2007) and Evans et al. (2011). If you are interested in
working in more depth on reciprocals, good stimulus materials can be found on the
MPI (Nijmegen) website.6
Morpho-syntax 141
ablative, inessive, elative, illative, perlative and terminative. See Blake (1994) for a
good discussion of different types of local cases and their functional range.
142 Morpho-syntax
6.7.6 Number
Languages mark number in different parts of the grammar, typically as numerals,
noun morphology or pronouns (see §6.7.3). A good overview of number systems
is given by Corbett (2000). Number is not a major category in all languages. Some
languages have no number morphology, such as Vietnamese.
Where a language marks number, it may make a singular/plural distinction as
in many Indo-European languages such as English. Other languages make a three
way singular/dual/plural number distinction, such as Jingulu (Mirndi, Australia)
(Pensalfini, 2003).
Morpho-syntax 143
a sense of what to expect if you begin to suspect that the language you’re working
on classifies nouns.
The distinction between gender and noun class systems is often talked about
as being based on the presence of agreement, although this distinction has been
problematised and gender can be thought of as a special kind of nominal classi-
fication (Corbett & Fedden, 2016). For example, Jingulu (Mirndi, Australia) dis-
tinguishes four genders: masculine, feminine, vegetable and neuter; and adjectives
and demonstratives show agreement in gender with the head noun (Meakins &
Pensalfini, 2016, p. 426):
Nouns are categorised into classes in different ways. Sometimes inherent seman-
tic dimensions, such as biological gender and animacy, are relevant. For example,
Tamil (Dravidian, Sri Lanka) has a purely semantic system with three classes that
divide nouns into male humans, female humans and other. For other languages,
semantics provides the basis for the system but other factors play a large role, for
example German phonology gives a lot of clues as to noun class assignment. For
instance, words ending in -e and -ung are more often than not feminine. One
method for trying to determine what principles underlie the classification is to
try borrowing words from the vernacular language into the system to see if the
phonology or semantics play a role.
14
144 Morpho-syntax
6.7.9 Evidentiality
Evidentiality expresses the nature of evidence for a given statement, i.e. whether
the speaker knows it as a fact, whether they heard it from someone else (hear-
say), whether they witnessed it etc. In English, evidential categories are expressed
through verbs and adverbs:
○ inferential
☐ information inferred by direct physical evidence
situations
☐ past deferred realisation.
• Reportative
○ hearsay
○ quotative e.g. suffix meaning ‘X said’
☐ Assumed.
Morpho-syntax 145
Table 6.5
Evidentiality and tense marking in Tariana
146 Morpho-syntax
These types of clitics are called simple clitics and they are the easiest to distinguish
from affixes because they have a corresponding full word form. Special clitics, such
as the genitive ’s in English, do not have a corresponding full word form, making
them harder to distinguish from affixes. Some general rules of thumb developed by
Anderson (2005) and Bauer (2003) are:
• Clitics can combine with many different word classes whereas affixes are
usually restricted by word class.
○ e.g. The man I was talking to’s got a cold (preposition), The cold you just
caught’s gotten worse (verb), The cure’s really easy (noun)
147
Morpho-syntax 147
• Affixes attach to lexical categories whereas clitics can attach to phrasal cat-
egories. This means they interact with syntax more than inflectional affixes.
○ e.g. Sophie’ll go to town (noun) vs The queen of England’ll go to town
(phrase)
• Clitics don’t tend to have irregular forms whereas irregular (and suppletive)
forms of inflectional affixes are common.
• There are no syntactic operations e.g. passivisation, which treat a word com-
bined with a clitic as a unit. e.g. The cat’s just eaten the mouse > *The mouse
just been eaten by the cat’s.
• Further clitics can attach to clitics; however, inflectional affixes cannot
attach to clitics i.e. clitics do not form new stems with their hosts. e.g. The
handle-er-s=ve forgotten to bring the animals.
Formal elicitation can form the basis of a number of fieldwork outputs, includ-
ing formal or comparative typological treatments of the morpho-syntax, or a full
descriptive grammar of the language. Many publishers are now open to innovative
ways of incorporating your audio-visual corpus. For instance, Mouton has pub-
lished the first sound-linked grammar of a language (Meakins & Nordlinger, 2014).
If you go down this path, think carefully about copyright agreements with publish-
ers. It is unlikely the speech community will want publishers to take copyright over
their voices and stories (§2.4.4). There are also a number of options for producing
a hypertext grammar which makes sense from the point of view of the user (gram-
mars are rarely read from cover to cover!) (Nordhoff, 2012).
Learner’s guides for non-linguists are also a useful by-product of grammatical
analysis. Learner’s guides are often requested in communities where the language is
endangered and there are many non-speakers; or where they have large numbers of
outsiders working in the community. Learner’s guides, or parts of them, also some-
times find their home in dictionaries (see §[Link]).
Learning Lakota
Lakota is a Siouan language spoken by Lakota people of the Sioux tribes in
the states of North and South Dakota in the United States. It is one of the few
First Nations languages of North America which continues to have a strong
speaker base. The language was first written down by missionaries around the
1840s. The recent Lakota Grammar (Figure 6.5) from the Lakota Language
Conservancy9 is a learner’s guide in that it has been written for pedagogical
purposes, but it still has quite a bit of grammatical detail. The learner’s gram-
mar is a part of the Lakota Language Conservancy’s efforts to standardise and
professionalise Lakota language teaching in tribal and neighbouring schools.
148
148 Morpho-syntax
Figure 6.5
Front cover of Lakota Grammar Handbook
Morpho-syntax 149
6.9 SUMMARY
Ameka, Dench & Evans (2006) and Nakayama & Rice (2014) contain a number
of useful articles on different aspects of grammar writing. Shopen’s (2007) classic
series Language typology and syntactic description and Payne’s (1997) Describing
morphosyntax: A guide for field linguists provide good direction for sorting out
morpho-syntactic categories. Bouquiaux & Thomas (1992) is a 700+ page English
translation of the original 1971 French field manual which has a 6,400 sentence
questionnaire – not for the faint hearted! Sebastian Nordhoff ’s (2009) grammar of
Sri Lankan Malay is model for a hypertext grammar. The ebook version of Meakins
& Nordlinger’s (2014) grammar of Bilinarra contains sound links to example
sentences.
NOTES
150 Morpho-syntax
REFERENCES
Morpho-syntax 151
Foley, W., & Van Valin Jr, R. D. (1985). Information packaging in the clause. In T. Shopen (Ed.),
Language typology and syntactic description (Vol. 1, pp. 282–354). Cambridge: Cambridge
University Press.
Fortescue, M., Evans, N., & Mithun, M. (2017). The Oxford handbook of polysynthesis. Oxford:
Oxford University Press.
Goddard, C. (1982). Case systems and case marking in Australian languages: A new interpre-
tation. Australian Journal of Linguistics, 2, 167–196.
Hale, K. (1983). A lexicographic study of some Australian languages: Project description.
In P. Austin (Ed.), Australian Aboriginal lexicography (pp. 71–107). Canberra: Pacific
Linguistics A-66.
Handschuh, C. (2015). A typology of marked-S languages. Berlin: Language Sciences Press.
Himmelmann, N. (1996). Demonstratives in narrative discourse: A taxonomy of uni-
versal uses. In B. Fox (Ed.), Studies in anaphora (pp. 203–252). Amsterdam: John
Benjamins.
Himmelmann, N. (1998). Documentary and descriptive linguistics. Linguistics, 36, 161–195.
Klaiman, M. (1992). Inverse systems. Lingua, 88, 227–261.
Laidig, W., & Laidig, C. (1990). Larike pronouns: Duals and trials in a central Moluccan
language. Oceanic Linguistics, 29, 87–109.
Levinson, S. (2003). Space in language and cognition: Explorations in cognitive diversity.
Cambridge: Cambridge University Press.
Levinson, S. (2006). The language of space in Yélî Dnye. In S. Levinson & D. Wilkins (Eds.),
Grammars of space: Explorations in cognitive diversity (pp. 157– 203). Cambridge:
Cambridge University Press.
Levinson, S., Cutfield, S., Dunn, M., Enfield, N., & Meira, S. (Eds.). (2017). Demonstratives in
cross-linguistic perspective. Cambridge: Cambridge University Press.
McGregor, W. (2010a). The expression of possession. Berlin: Mouton de Gruyter.
McGregor, W. B. (2010b). Optional ergative case marking systems in a typological-semiotic
perspective. Lingua, 120(7), 1610–1636.
Meakins, F., & Nordlinger, R. (2014). A grammar of Bilinarra: An Australian Aboriginal lan-
guage of the Northern Territory. Boston, MA: De Gruyter Mouton.
Meakins, F., & Pensalfini, R. (2016). Gender bender: Disagreement in Jingulu noun class
marking. In F. Meakins & C. O’Shannessy (Eds.), Loss and renewal: Australian languages
since colonisation (pp. 425–450). Berlin: Mouton de Gruyter.
Moravcsik, E. (1978). Case-marking of objects. In J. Greenberg, C. Ferguson, & E. Moravcsik
(Eds.), Universals of human language (pp. 249–289). Stanford, CA: Stanford University
Press.
Nakayama, T., & Rice, K. (2014). The art and practice of grammar writing. Honolulu, HI:
University of Hawai‘i Press.
Nash, D. (1996). Pronominal clitic variation in the Yapa languages. In W. McGregor (Ed.),
Studies in Kimberley languages in honour of Howard Coate (pp. 117–138). Munich:
Lincom Europa.
Nedjalkov, V. (2007). Reciprocal constructions. Amsterdam: John Benjamins.
Nordhoff, S. (2009). A grammar of Upcountry Sri Lanka Malay. Utrecht: LOT.
Nordhoff, S. (2012). Electronic grammaticography. Honolulu, HI: University of Hawai‘i Press.
Payne, T. (1997). Describing morphosyntax: A guide for field linguists. Cambridge: Cambridge
University Press.
Pensalfini, R. (2003). A grammar of Jingulu: An Aboriginal language of the Northern Territory.
Canberra: Pacific Linguistics.
Quine, W. (1960). Word and object. Cambridge, MA: MIT Press.
152
152 Morpho-syntax
Schultze-Berndt, E. (2000). Simple and complex verbs in Jaminjung: A study of event categori-
sation in an Australian language. Wageningan: Ponsen and Looijen.
Schultze-Berndt, E. (2006). Sketch of a Jaminjung grammar of space. In S. Levinson &
D. Wilkins (Eds.), Grammars of space: Explorations in cognitive diversity (pp. 63–114).
Cambridge: Cambridge University Press.
Seyfeddinipur, M. (2012). Reasons for documenting gestures and suggestions for how
to go about it. In N. Thieberger (Ed.), The Oxford handbook of linguistic fieldwork
(pp. 147–165). Oxford: Oxford University Press.
Shopen, T. (2007). Language typology and syntactic description (Vols. 1–3). Cambridge:
Cambridge University Press.
Silverstein, M. (1976). Hierarchy of features and ergativity. In R. M. W. Dixon (Ed.),
Grammatical categories in Australian languages (pp. 112–171). Atlantic Highlands, NJ:
Humanities Press.
Skopeteas, S. (2010). Word order and information structure: Empirical methods for linguistic
fieldwork. Unpublished Work, Potsdam.
Wilkins, D. (1989). Mparntwe Arrernte (Aranda): Studies in the structure and semantics of
grammar. PhD thesis: Australian National University.
153
7.1 INTRODUCTION
Once you have some grounding in the phonological and grammatical analyses of
the language, you will need to start investigating how the language conveys mean-
ings. This can be a very exciting process as you learn about different ways of describ-
ing, categorising and speaking that may seem very unfamiliar at first, at least if it’s
not your own language. In this chapter we consider methods and techniques to
elicit the meanings of words. Different types of words require different types of
investigation. For example, for kin terms, you will need to have an understanding
of the social categories in the culture to elucidate their meanings. For plants and
animals, it helps to have an understanding of the Linnaean system of classification
or at least a reference guide with this information.
We can approach the issue of identifying meaning from two angles. We can
start from a lexical form and explore its different meanings; or we can begin with a
domain of meaning and explore all the different ways such meanings are expressed.
For example, we can start with a body part term such as nuu ‘throat’ in Yélî Dnye
and explore its occurrences in different contexts. The word nuu ‘throat’ combines
with different verbs and nouns to express emotions in different ways as seen in
the example A nuu u tpile “A thing I really like” (literally, “My throat its thing”)
(Levinson, 2006, p. 237). Alternatively, we can take a broader approach and begin
with a semantic domain, such as ‘emotion’ where we find many lexical items (e.g.
‘sad’, ‘happy’). Both strategies are useful when delving into semantics.
One of the most useful outcomes of linguistic fieldwork is a dictionary. Not only
linguists, but other social scientists, the speech community and people working
in the community in a diverse range of employment contexts may all benefit from
access to a dictionary of the language. The basis of the dictionary is the lexical data-
base, which is also an excellent way of compiling and organising the results of your
fieldwork.
In this chapter, we first consider how to elicit vocabulary using a spoken language,
where we cover semantic domains and taxonomies (§7.2); and then elicitation using
non-linguistic stimuli such as pictures and videos (§7.3). We also consider how to
elicit special register vocabulary (§7.4) and look at ways of making a simple dic-
tionary (§7.5) before moving on to a discussion of the ways that more complex
information can be included in dictionaries (§7.6). Finally, we consider digital tools
available for making and displaying dictionaries (§7.7).
DOI: 10.4324/9780203701294-7
This chapter has been made available under a CC-BY-NC-ND license.
154
In your first fieldwork session you might start by exploring some everyday well-
known vocabulary such as greetings, kin terms and words for objects in the natural
environment that are easily observable or readily at hand. Alternatively, you may
try eliciting vocabulary items from a wordlist (§5.2). There are many short word-
list questionnaires of basic vocabulary. The most well-known are the 100 and 200
word Swadesh (1950) lists. These were designed to enable cross-linguistic compari-
sons and thus they aim for universal and non-culturally specific concepts. There are
also many geographically focused wordlist questionnaires which include culturally
important vocabulary and omit irrelevant vocabulary. Some well-known regional
lists are given below:
• Wordlist for Australian Languages (Menning & Nash, 1981; Sutton & Walsh,
1987)
• CALMSEA (Culturally Appropriate Lexicostatistical Model for South East
Asia) (Matisoff, 1978)
• Sino-Tibetan Etymological Dictionary and Thesaurus word list questionnaire1
• Comparative African Wordlist.2
Most of these questionnaires are in English. These short wordlists can be useful if
you only have limited time in the field or if you unexpectedly have a brief oppor-
tunity to record a speaker of a language for which there is little documentation. To
obtain a much larger vocabulary and elucidate the semantics of the words, elicita-
tion from longer wordlists grouped in semantic domains is required.
Figure 7.1
AIATSIS list of semantic domains for Australian languages
J: birds W: interrogatives
M: plants Z: pronouns
(see §4.2.3). Mosel (2004, p. 45) proposes a method she calls ‘active eliciting’, where
language consultants choose their own semantic domains –for example, terms for
different types of food, food preparation and so on.
Before you go into the field, select the appropriate semantic domain codes for
the region and expand each domain by adding concepts that you predict might be
lexicalised in the language. Consider other possible categorisations. For example,
the domain of ‘birds’ may not include emus and cassowaries because they don’t fly,
as is the case in many languages of Australia and Papua New Guinea. Conversely,
some Australian languages class ‘bat’ and ‘grasshopper’ as ‘birds’. Languages with
noun classes or gender may make it possible to access these language-specific
categorisations, for example, emus may not receive the same gender as other birds
(§6.7.8).
Wordlist elicitation is often noun heavy, so pay particular attention to eliciting
verbs that might collocate with these nouns (e.g. ‘fish’ and ‘swim’). One strategy
to find words is to use a dictionary of a related or neighbouring language to elicit
156
Table 7.1
An example of a semantically based elicitation list
A1.2 leg
A1.2.1 knee
A1.2.2 kneecap
A1.2.3 thigh
A1.2.4 calf
words. However, make sure to cross-check this data with a range of speakers, as it is
easy to introduce errors in this way and to over-state similarities between languages.
Your ability to detect errors will improve if you have close collaborations with bilin-
gual speakers and as you learn more about the language.
It is a good idea to use a spreadsheet and put each elicitation word into semantic
domains or sub-semantic domains so that you can sort your data in different ways.
An example is given in Table 7.1 where A1 is the code for body parts and A1.2 is the
code for parts of the leg.
One way to assess whether you have covered a semantic domain thoroughly is
to compare the number of words you have found with that of a similar language.
For example, in Oceanic languages you might compare the number of words in the
domain of ‘fish’ in one dictionary with the number you find in the language you are
working on (Pawley, 2012). This can be a quick way to work out where you need
to do more research, but note that it is possible there are real differences in the size
of the lexicon, even when comparing the languages of two similar cultures. The
domain of kin terms is often one in which there is a large degree of non-equivalence
cross-linguistically. If the kinship system is complex, it is a good idea to work out
genealogies with research participants to help you map out the precise denotation
of terms.
can’t find the relevant specialist to help you, you will have to rely on field guides,
plant/animal checklists and photos (§2.7.4). Bear in mind though that the most
reliable way of collecting words and meanings in the natural world domain is by
first-hand observation and experience. A scientific identification based on showing
a speaker a photo of a species and asking for its name is going to be less reliable than
one based on a context where the speaker comes across the species in its natural
environment and volunteers a name for it. The context, including habitat, sounds,
smells, time of day etc., is often critical to people’s ability to correctly identify spe-
cies. Furthermore, being in situ often leads to more insightful information about
meanings, lexical relations and vocabulary, as well as cultural and general knowl-
edge. The example of the Alyawarr bush potato plant at the beginning of this book,
and on the front cover, was drawn from on-the-ground experience –locating the
plant, digging the tubers up, then cooking and eating them.
Table 7.2
Literal names for the constellation Ursa Major in Alaskan languages
Other natural systems are interesting to explore for their ethno-c ategories.
In many cultures, knowledge of the natural world is deeply embedded in
religion and mythology. The Alaskan example in Table 7.2 is taken from the
domain of astronomy and it shows the names many Alaskan languages use
for the star constellation Ursa major. For the linguist, it can be very difficult
to know what particular constellation is being named if a speaker points one
out. Pictures or star charts can help when you are in the field, but if there is
the opportunity to take one or more language speakers to a planetarium this
can be a lot of fun.
Language used in more natural settings such as narratives, conversations and pro-
cedural texts (to name just a few genres) can provide crucial evidence about the
meanings of words. These extended texts can also shed light on patterns of poly-
semy and taxonomies through collocations and the cultural information they may
convey. Nonetheless they may not provide answers to some questions of semantic
scope. Furthermore, a corpus of a lesser-known language rarely includes all the
words in the lexicon and often there are not enough instances of the terms whose
semantics are in question, making it difficult to extrapolate about meaning or
grammar (Hellwig, 2006). In these situations, non-linguistic stimuli, such as visual
prompts (pictures, videos and three-dimensional objects) and even those based
on sounds, tastes or smells can be used either (semi-)experimentally or simply as
prompts to collect data. The use of non-linguistic stimuli for elicitation is a good
way to get at “the juicy semantic detail of the language” (Majid, 2012, p. 3). They can
help delineate meanings, produce excellent illustrative examples, and even provide
content for definitions that may later be used in dictionaries (§7.6.2). Such stimulus
materials can be used across languages, thus creating corpora of comparable data
for cross-linguistic semantic analysis.
Many (semi- )experimental tasks and stimulus sets have been designed
by the Language and Cognition group at the Max Planck Institute for
Psycholinguistics (MPI) in Nijmegen (the Netherlands) as a part of their research
on semantic typology.8 They include, for example, stimulus sets for investigat-
ing spatial cognition and demonstratives (§6.7.5), event categorisation, time,
perception and emotion, tactile texture, taste, smell and colour. Descriptions of
the tests and the stimulus materials can be found on the MPI website and can
be used with permission and proper attribution.9 Majid (2012) also provides an
excellent chapter on the use of these materials. The advantage of these ques-
tionnaires and tasks is that they help elucidate both cross-linguistic differences
between languages, and inter-speaker variation within a community. They have
also been used to explore domains beyond semantics, for example morpho-
syntactic distinctions such as reciprocals (§6.7.4).
The MPI stimulus sets are mostly conducted as director-matcher tasks which are
designed around sets of stimuli, usually pictures or videos, that are minimally dif-
ferent to each other. The ‘director’ has a board with the pictures facing away from
the ‘matcher’ who listens to the sentences (see §[Link] for a picture of a director-
matcher task in action). The ‘matcher’ has a second set of free cards in front of her.
The ‘director’ asks for a card, and the ‘matcher’ selects the matching card and gives
16
When creating a picture, video or object-based stimuli for elicitation make sure
the content is easily identifiable to speakers. Aim for tasks that are fun –amateur
videos with local content can be a good way of introducing variety to the field-
work tasks, and if you are in them yourself even better! Well before you begin
the tasks with speakers, you should trial the images with one or two speakers to
ensure that people know what they represent. You may need to modify or reject
162
During fieldwork you may encounter words that are only used in particular social
settings. Such special ways of talking are referred to as ‘special’ or ‘alternate’ regis-
ters. For example, in many Australian languages a respect register, also referred to as
‘mother-in-law’ or ‘avoidance’ language, is prototypically used in the presence of, or
to refer to, particular kin relations (particularly mothers-in-law and sons-in-law).
163
These languages have specialised vocabulary that is used to refer to these relations,
their actions and even everyday items with which they are associated, such as meat,
fire and water. Other registers may be used in mourning or in ceremonial or reli-
gious contexts. In the Papuan language Kalam, there is a special register known
as ‘Pandanus language’, which consists of around 1,000 words and phrases that
are used when people are harvesting and cooking pandanus nuts (Pawley, 1992).
As well as having unique vocabulary, these words may conflate semantic distinc-
tions made in the everyday speech register. Everyday words may also have different
meanings in special registers. For example, in formal registers plural pronouns may
have singular reference.
It is very exciting to come across these specialised kinds of speech in your field-
work. However, it is not always appropriate to record special registers in their natu-
ral context of use, which may be intensely personal and emotional, for example
during bereavement. In such cases it may be more appropriate to record discussions
about these registers. However, it is important to try to understand the social con-
text in which they are used. A good starting point is to compile what has been writ-
ten about these registers in similar cultures or languages. You are then in a much
better position to ask sensible questions about the sorts of contexts one would and
wouldn’t use them in.
In highly endangered languages, special registers are often associated with prac-
tices that are at the core of people’s cultural and social identity. As such, a doc-
umentation of respect or polite vocabulary, with demonstrations of how these
words are used and in which situations, can be highly valued by the community.
Documentation of these registers can be incorporated into language and culture
programs, cross-cultural inductions and resources aimed at promoting the com-
munity’s language and culture.
Table 7.3
A comparison of some words in Ordinary Awiakay and Mountain Awiakay
awin ‘mountain bird, spirit of a dead man’ tine pawiakay ‘red bird’
7.5 LEXICOGRAPHY
definition forms the central part of the entry. In contrast, bilingual dictionaries
typically translate from one language to another. Bilingual dictionaries can be
unidirectional, listing words of one language and translating these into another,
or bidirectional, allowing translation to and from both languages. For endangered
languages that are often the focus of fieldwork this other language is usually a
major language such as English, Spanish or French. Sometimes a third language,
such as the national language, is also included (see Figure 7.8). But even unidi-
rectional bilingual dictionaries usually have a section, typically at the end, that
reverses the direction of the two languages to some degree. This is known as a
finder list or reversal (§7.6.6).
A dictionary can be published in either print or digital form. As long as your
data and workflows are well structured, both options may be open to you. A print
dictionary can be held in the hand and can have a powerful symbolic impact.
You can also flick through a book to see non-adjacent words easily. A print dic-
tionary may be the only practical option in communities with limited access to
modern technologies (Mosel, 2011, p. 339), but the situation worldwide is chang-
ing rapidly. Digital dictionaries have a number of advantages. One is being able
to search for words in multiple ways, reducing the dependency on alphabetical
order as a search principle. It is now common for digital dictionaries to have
a ‘fuzzy search’ function; where search results show multiple words even when
the search text has been misspelt. A digital dictionary also has the advantage of
being able to incorporate sound and video. And while the cost and size of print
dictionaries may mean limiting the amount of information included, digital dic-
tionaries are not constrained in this way so long as the images and audio are the
right resolution. A digital dictionary can also be updated incrementally at low
cost. If the internet is available, a digital dictionary can be disseminated widely
on different types of devices, such as on mobile phones. Various digital interfaces
for dictionaries are discussed in §7.7.2.
It is a good idea to get drafts of the dictionary out as soon as possible and circu-
lated in the community to get feedback on layout, usability and of course errors. If
most community members have poor eyesight, then you should also consider care-
fully the size of the font if making a print dictionary.
Figure 7.2
A page from the Central Anmatyerr Picture Dictionary that illustrates some person terms
Source: Green, 2003, p. 3
167
A dictionary of Bikol
Not long ago I came across a nice example of how an encyclopedic or eth-
nographic dictionary can serve as a data resource for future generations,
centuries after it was compiled. Malcolm Mintz, an Austronesianist who
specialises in Bikol, spoken in southern Luzon, a few years ago published
a book about Bikol society around 1600 (Mintz 2011). His book contains
chapters giving detailed accounts of food and society, war and conflict,
crime and punishment, religion, childhood and the family, and rice and
agricultural and marketing practices. Where did he get his data? Largely
from a remarkable dictionary, compiled around AD 1600 by the Franciscan
padre, Marcos de Lisboa (1754), who lived for ten years among the Bikol
people.
(Pawley, 2015, p. 138)
Figure 7.3 shows part of a page of a bilingual dictionary of the Anmatyerr lan-
guage from Central Australia, showing some of the types of information in a typical
entry. Most prominent are the headwords, usually displayed in a distinctive font
and arranged in alphabetical order. In this entry homonyms, words that are spelt
and sound the same but have distinct meanings, are distinguished by superscript
numbers. The entry contains an indication of the part of speech (§6.4), definitions
or glosses for the headwords, and example sentences in Anmatyerr and translated
into English. Dictionary entries also alert the reader to minimal pairs, or words that
sound similar to each other (§5.4.1) and to words with related meanings to those of
the headword. There is information about scientific classification if the word is in
the biological domain.
7.6.1 Headwords
A headword is the first and most prominent word in a dictionary entry. Under
the headword, dictionaries then typically list idioms and derived words.
Alternatively, derived words can be written as separate headwords, thus leaving
it up to the reader to determine any semantic relatedness to the previous or fol-
lowing entries.
A print dictionary relies on searching by alphabetical ordering; however, this
may be problematic in languages that have variability at the start of words and for
polysynthetic languages with a large number of prefixes. One decision to make is
whether to use an inflected form as the citation form or an unnatural root form.
Listing all possible forms is not practical as there may simply be too many, making
the size of an entry span multiple pages, and making it difficult for the user to
know where the entry for a headword starts and ends. For words that have many
different inflected forms, you will need to decide which should be the main head-
word. For example, for verbs you may choose to have a form such as ‘walk’ as the
main headword rather than ‘walks’ or ‘walked’. You might want to include irregular
inflected forms as ‘signpost’ entries to the main form. This can be done automati-
cally in FLEx, a software package discussed in §4.3.2. If the forms are regular, it is
useful to give an example of one fully inflected verb with the citation form bolded
in the front matter of the dictionary or as an appendix. A similar decision needs to
169
Figure 7.3
A guide to the parts of a dictionary entry in the Central & Eastern Anmatyerr to English Dictionary
Source: adapted from Green, 2010, p. 411
be made with variant forms. Remember there may be no right or wrong decision,
as each dictionary entry style has pros and cons in terms of usability; the important
thing is to make a decision, stick to it and get on with the job!
Next come meanings, and if a headword has more than one related mean-
ing (or sense) these are generally numbered sequentially in the entry (see
Figure 7.3). Superscript numbers are usually used to distinguish homonyms
(words that sound identical but their meanings are not related). The issue of
deciding whether words are homonyms or one word with related meanings is not
always straightforward, even in a monolingual situation. For example, is “[siŋk]”,
a thing in the kitchen, related in meaning to “a low area or hole”? Further prob-
lems arise in bilingual dictionaries, where the division of meanings into distinct
senses and the ordering of these senses may reflect polysemy found in either the
source language or the target language (see Evans, 1997; Goddard & Thieberger,
1997). As a starting point it is useful to assume homophony unless there is strong
evidence for a semantic relationship.
170
Figure 7.4
Definitions for some of the words derived from daa’ ‘horn’ or ‘antler’ in Koyukon
daa’ /horn/
-daa’, daa (n., inc., cmp,: Ø) horn, antler, especially referring to the whole antler structure
-daa’ leɬ (n.; de) velvet covering moose antlers from May to August, lit. ‘horn skin’
-daadeeken (n,; Ø) base of the horns or antlers at their point of eruption from the skull;
antennae or ‘feelers’ of insect
-daaloghuze, -daahus (n.; Ø) point of the tine of a moose or caribou antler, lit. ‘thorns at the
end of the horns’
17
Figure 7.5
The dictionary definition of the word ‘SMI’ in Kalam
SMI [símí˖], n. The all-night dance-festival when taro has been harvested. The smi is the
which a man (the b smi or smi nop) and his major ceremonial institution of Kalam society.
co-resident extended family may host as the It involves extensive economic preparation
climax of the ceremonial activity in which in planting gardens, building up pig stock
youths and, later, girls have their nasal and assembling gift valuables and personal
septa pierced (compare miuk puŋi-), pigs ornaments; the building of special dance
are killed and cooked, and pork, axes and houses (smi kopt); the propitiation of dead kin
shell valuables are given to affinal groups. (cp kawnan) and nature demons (kceki); the
The dance is also attended by performers fulfillment, through gifts, of the obligations
and spectators who are not kin of the hosts of kinship and affinity (compare tusmen); and
and who do not stay to receive gifts and as a measure of the prosperity, prestige and
participate in the feasting on the following political influence of the host group and its
day. Large quantities of taro are required to leader.
entertain guests, and the festivals are held
from late July to early December,
172
Another example is from the Teop encyclopaedic dictionary of marine life and fish-
ing (Mosel, 2017). It has over a thousand entries and is an example of a dictionary that
is focused on a single semantic domain. It includes line drawings and photos as well as
many bilingual texts. An example taken from the entry for bakubaku ‘shark’ is below:
Bero a maamihu kaku bakubaku. A meha a beera sana to aniani aba nana. Amaa meha
saka beera vira haari. Amaa meha amaa vaamanae, eara o aba to ani rae. A peha bakubaku
na tei me nana bono buaku totoka o kekeetoo. O buaku matana, o meho buaku komanae, o
meho buaku paana vihinae, o peho hee toon nae.
There are many kinds of shark. There is a very big one that eats human beings. Others are not
very big. The young ones, it is us, the human beings, who eat them. A shark has seven fins.
Two are in the front, two others at its belly, another two under its tail, and one on its back.
(Source: Sii 46W 145–150)10
example in a dictionary is short and natural and shows how the word is used in a
sentence. As well as drawing on examples from texts of spontaneous speech, asking
a speaker to make up a sentence to demonstrate how a word is used can also be a
useful approach. Be aware, however, that you might end up with some sentences
that would never actually be said in everyday interactions!
It is best to have some basic vocabulary and grammar under your belt and famili-
arity with the cultural context before seeking illustrative examples. It is a good idea
to have one or two illustrative examples in the language (perhaps from your earlier
recordings) which you can use to model what you would like in the elicitation ses-
sion. You should also prepare some possible scenarios in which the word might be
used beforehand to assist speakers if they appear stuck. In some cases, speakers can
compose more interesting illustrative examples when working in pairs or in groups,
and this approach can also be much more fun. Remember that the most interesting
vocabulary often emerges in contexts where speakers themselves choose what to
say, and where free-flowing conversations about the meanings of words can lead in
unexpected but often illuminating directions:
By working in the native language, taking full advantage of the metalinguistic capacity
of the language and its speakers, we can create a snowball effect whereby speakers’ com-
mentaries reveal other features of the language, as well as translating into words their
judgements of typical and proper usage.
(Hanks, 2015, p. 34)
Figure 7.6
Part of the finder list entry for ‘uncle’ in the Central & Eastern Anmatyerr to English Dictionary
Source: Green, 2010, p. 719
uncle
Technology has come a long way since the first dictionaries were made using a pile
of cards, a headword on each, and sorted alphabetically on the living-room floor.
In this section we consider the pros and cons of the currently available dictionary-
making tools and interfaces.
Figure 7.7
Example of the MDF codes used in an entry from the Gurindji to English Dictionary
sentence in the online version of the dictionary. The reference (\rf) means the
sentence is ‘discoverable’ in the Gurindji corpus. The word kurlarnimpa is also cross-
referenced (\cf) to the entry for kurlarra ‘south’ which has all of the 24 inflected
forms of ‘south’ included as sub-entries to show the paradigm for this cardinal
term. The paradigms for each of the four cardinal directions are also included in
the front matter of the dictionary.
Toolbox is a free data management and analysis tool developed by SIL International
for field linguists. It is especially useful for maintaining lexical data, and for parsing
and interlinearising texts, but it can be used to manage virtually any kind of data.
For ease of use, the Toolbox package includes prepared database definitions for a
typical dictionary and text corpus. Toolbox offers powerful functionality, including
customised sorting, multiple views of the same database, a browse view to show
data in tabular form, and filtering to show subsets of a database. It can deal with any
number of scripts in the same database. While Unicode is preferred, Toolbox can
handle scripts in most legacy encoding systems. Toolbox also has powerful linguis-
tic functionality. It includes a morphological parser that can handle almost all types
of morphophonemic processes. It has a word formula component that allows the
linguist to describe all the possible affix patterns that occur in words. It has a user-
definable interlinear text generation system which uses the morphological parser
and lexicon to generate annotated text. Interlinear text can be exported in a form
suitable for use in linguistic papers. Toolbox has export capabilities that can be used
to produce a publishable dictionary from a dictionary database.
Lexique Pro,14 developed by SIL International, is a free lexicon viewer for dic-
tionaries. It interfaces with Toolbox (or a formatted text file using MDF fields) to
produce the dictionary and finder list. It can be used to view and edit an existing
Toolbox database (or text file), and then finally export the dictionary to print or
web-based formats (Guérin & Lacrampe, 2007). Audio and images can be included.
It can be configured to display your Toolbox database in a user-friendly format so
that you can distribute it to others. On the down side, note that it can only be used
on Macs in a virtual PC environment.
FLEx (Fieldwork Language Explorer)15 is another tool developed by SIL
International to help field linguists with elicitation, dictionary creation and publica-
tion, and interlinearising text. It also only works in the Windows environment and
on Linux. As the dictionary grows, FLEx suggests morphological parsing. FLEx also
allows multiple users in different locations to contribute to the one dictionary data-
base through its Send/Receive function. Figure 7.8 shows a headword ájiita from the
Iquito dictionary (Zaparoan, Peru) in FLEx. The headword ájiita is a root (Spanish
raiz), and it can be seen that the citation form is ajiráani. There are two related words,
ajiraákuma and ajiratíini, as well as a derivational root, ajíra. There are reversal
forms in both Spanish and English, picar and pierce, that lead back to the headword
ájiita. Definitions of the various senses of this word follow (not shown here).
Two other off- the-shelf dictionary- making software packages are Miromaa
Community Dictionary Maker16 designed for Australian Indigenous languages
and TLex (aka TshwaneLex), which has been adopted by some publishers, govern-
ment organisations and individuals.17
Figure 7.8
An example from the Iquito dictionary in FLEx, showing the headword information (citation forms,
related forms, morphological type) and the beginning of the next section on meanings where reversal
fields are located (Illustration: Maxine Addinsall)
Source: Michael & Beier (in press)
Entry
Lexeme Form Iqu
ájiita
Morph Type raiz
RelatedForms Iqu
ajiratíini (ajirátii)
Spn
Eng stab, pierce, mash, or peck repeatedly
Sense 1
Gloss Spn picar
Eng pierce
Reversal Spn picar hincar
Eng pierce stab
Entries
options for digital display as well as options for exporting print-formatted layouts.
SIL International has a growing catalogue of free online dictionaries.18 Some of
these contain sophisticated search functions that allow a user to search the diction-
ary based on words in either language or by semantic domains or by parts of
speech. Some, such as the Lakota to English Dictionary, come with language learn-
ing materials that can be downloaded and purchased online.19 Another example of
an online dictionary is the Tahitian dictionary, a bilingual dictionary searchable by
either Tahitian or French (see Greenhill & Clark, 2011).20
Line illustrations or photos can be easily displayed in online dictionaries for elu-
cidating the meanings of words or concepts that do not have ready equivalents in
another language. For example, in the online version of the Archi-Russian-English
Dictionary (a Lezgic language spoken by about 1,200 people in Daghestan), a pho-
tograph is used to illustrate the meaning of t’uq’ˤ, a ‘stone post inside an under-
ground sheepfold which supports the stone roof ’ (Figure 7.9).21 The dictionary also
includes audio links to inflected forms of the headword.
Kirrkirr is a program written in Java that displays a lexical database in innovative
ways, such as showing semantic networks of words. It also provides useful tools for
179
Figure 7.9
An entry from the dictionary of Archi
Source: Chumakina, et al., 2007
non-technical end users. Kirrkirr has been used to display dictionaries of languages
such as Nahuatl, an Indigenous language of Mexico; Biao Min, a language from
southern China; as well as a number of Australian languages and even English. The
lexical database needs to be in xml format and it requires a schema matching the
dictionary fields to the Kirrkirr concepts (Manning, Jansz, & Indurkhya, 2001).22
A well-structured dictionary text file can also be displayed in a web browser,
which can be locally hosted on a computer or on the web. In this format it is pos-
sible to update the display ‘live’, that is, as the database is changed. Some online
dictionaries also use a ‘fuzzy search’ function whereby a word can be located even
if it is misspelled by the user. The Yolŋu Matha Dictionary enables the dictionary
user to spell out words using a set of filters that predict some of the orthographic
symbols that represent sounds that may be misheard by some users, for example the
difference between a retroflex or non-retroflex consonant, and whether or not some
consonants are voiced or not.23
There are also now a large number of mobile phone dictionary apps in Indigenous
languages, such as Tusaalanga, a dictionary of the Canadian language Inuktitut, and
My Cree, which is aimed at Native youth wanting to learn how to speak Plains Cree.
Design issues for mobile phone dictionary and learners apps include dealing with large
file sizes that may not download or stream well, and finding ways to avoid cluttered
screens on small devices.24
180
• Definition –ideally you would ask your consultant for his/her definition
in the language concerned
• Gloss –simple one word/phrase English or other target language equivalents
• Grammatical information (e.g. if it is a noun, what classifier is used with it)
• Pragmatic information (e.g. what register it belongs to)
• Encyclopaedic information
• An illustrative sentence. It is best to use sentences a speaker has uttered
(and approved); constructed examples are to be avoided
• Divisions into senses and sub-entries
• Semantic domain codes for each of the senses
• Reversal codes for the headword.
7.8 SUMMARY
For descriptions of semantic fieldwork techniques see Majid (2012). You can find
many tools for semantic elicitation in the field at the Max Plank Institute’s Language
and Cognition Field Manuals and Stimulus materials site.25 For more detailed infor-
mation on the role of biological expertise in linguistic fieldwork see Conn (2012)
and McClatchey (2012) and for astronomy see Holbrook (2012). Discussions of
lexicography in Indigenous languages of Australia and the Pacific are found in
Goddard & Thieberger (1997) and Thieberger (2015). Coward & Grimes (2000a)
discuss techniques for dictionary making and further information about tools used
for lexicography can be found on the SIL International website.26
18
NOTES
REFERENCES
Corris, M., Manning, C., Poetsch, S., & Simpson, J. (2004). How useful and usable are
dictionaries for speakers of Australian Indigenous languages? International Journal of
Lexicography, 17(1), 33–68.
Coward, D., & Grimes, C. (2000). Making dictionaries: A guide to lexicography and the multi-
dictionary formatter. Waxhaw, North Carolina: SIL International. [Link]
books_400/N_[Link]
Dingemanse, M. (2015). Folk definitions in linguistic fieldwork. In J. Essegbey,
B. Henderson, & F. McLaughlin (Eds.), Culture and language use (pp. 215– 238).
Amsterdam: John Benjamins.
Evans, N. (1997). Sign metonymies and the problem of flora-fauna polysemy in Australian
linguistics. In D. Tryon & M. Walsh (Eds.), Boundary rider: Essays in honour of Geoffrey
O’Grady (pp. 133–153). Canberra: Pacific Linguistics.
Goddard, C., & Thieberger, N. (1997). Lexicographic research on Australian Aboriginal lan-
guages. In D. Tryon & M. Walsh (Eds.), Boundary rider: Essays in honour of Geoffrey
O’Grady (pp. 175–208). Canberra: Pacific Linguistics.
Green, J. (2003). Central Anmatyerr picture dictionary. Alice Springs: IAD Press.
Green, J. (2010). Central & Eastern Anmatyerr to English dictionary. Alice Springs:
IAD Press.
Greenhill, S. & Clark, R. (2011). POLLEX-Online: The Polynesian Lexicon Project Online.
Oceanic Linguistics, 50(2), 551–559.
Guérin, V., & Lacrampe, S. (2007). Lexique Pro. Technology Review, 1(2), 291–300.
Hanks, W. F. (2015). The Space of translation. In C. Severi & W. F. Hanks (Eds.),
Translating worlds: The epistemological space of translation (pp. 21–49). Chicago,
IL: Hau Books.
Hellwig, B. (2006). Field semantics and grammar-writing: Stimuli-based techniques and the
study of locative verbs. In F. Ameka, A. Dench, & N. Evans (Eds.), Catching language: The
standing challenge of grammar writing (pp. 321–358). Berlin: Mouton de Gruyter.
Hoenigman, D. (2012). From mountain talk to hidden talk: Continuity and change in Awiakay
registers. Honolulu, HI: University of Hawai‘i Press.
Holbrook, J. (2012). Cultural astronomy for linguists. In N. Thieberger (Ed.), The
Oxford handbook of linguistic fieldwork (pp. 345–367). Oxford: Oxford University
Press.
Holton, G. (2012). Language archives: They’re not just for linguists any more. In F. Seifart,
G. Haig, N. Himmelmann, D. Jung, A. Margetts, and P. Trilsbeek (Eds.), Potentials of
language documentation: Methods, analyses, and utilization (pp. 111–117). Honolulu,
HI: University of Hawai‘i Press.
Jetté, J., & Jones, E. (2000). Koyukon Athabaskan dictionary. Fairbanks, AK: Alaska Native
Language Center.
Kroskrity, P. V. (2015). Designing a dictionary for an endangered language community:
Lexicographical deliberations, language ideological clarifications. Language Documentation
& Conservation, 9, 140–157.
Laughren, M., Hale, K., & the Warlpiri Lexicography Group (1987). Warlpiri–English diction-
ary: Body-part section. Lexicon Project Working Paper 6. Cambridge, MA: MIT Centre
for Cognitive Science.
Levinson, S. (2006). Parts of the body in Yélî Dnye, the Papuan language of Rossel Island.
Language Sciences, 28, 221–240.
Majid, A. (2012). A guide to stimulus-based elicitation for semantic categories. In N.
Thieberger (Ed.), The Oxford handbook of linguistic fieldwork (pp. 54–71). Oxford: Oxford
University Press.
183
Majid, A., Enfield, N., & van Staden, M. (2006). Cross-linguistic categorisation of the
body: Introduction. Language Sciences, 28(2–3), 137–147.
Manning, C. D., Jansz, K., & Indurkhya, N. (2001). Kirrkirr: Software for browsing and visual
exploration of a structured Warlpiri dictionary. Literary and Linguistic Computing, 16(2),
135–151.
Matisoff, J. (1978). Variational semantics in Tibeto-Burman: The “organic” approach to linguis-
tic comparison. Philadelphia, PA: Institute for the Study of Human Issues.
McClatchey, W. (2012). Ethnobiology: Basic methods for documenting biological knowledge
represented in languages. In N. Thieberger (Ed.), The Oxford handbook of linguistic field-
work (pp. 281–297). Oxford: Oxford University Press.
Meakins, F., McConvell, P., Charola, E., McNair, N., McNair, H., & Campbell, L. (2013).
Gurindji to English dictionary. Batchelor, Australia: Batchelor Press.
Menning, K., & Nash, D. (Eds.). (1981). Source book for Central Australian languages. Alice
Springs: IAD Press.
Michael, L., & Beier, C. (In press). A dictionary of Iquito.
Mosel, U. (2004). Dictionary making in endangered speech communities. In P. Austin (Ed.),
Language Documentation and Description (pp. 39–54). London: SOAS.
Mosel, U. (2011). Lexicography in endangered language communities. In P. K. Austin &
J. Sallabank (Eds.), The Cambridge handbook of endangered languages (pp. 337–353).
Cambridge: Cambridge University Press.
Mosel, U. (2017). Teop encyclopedic dictionary of marine life and fishing. Dictionaria, (2),
1–1014.
Narasimhan, B., Kopecka, A., Bowerman, M., Gullberg, M., & Majid, A. (2012). Putting
and taking events: A crosslinguistic perspective. In A. Kopecka & M. Bowerman (Eds.),
Events of putting and taking: A crosslinguistic perspective (pp. 1–18). Amsterdam: John
Benjamins.
Pawley, A. (1992). Kalam Pandanus language: An old New Guinea experiment in language
engineering. In T. Dutton, M. Ross, & D. Tryon (Eds.), The language game: Papers in
memory of Donald C. Laycock (pp. 313–334). Canberra: Pacific Linguistics.
Pawley, A. (2012). What does it take to make an ethnographic dictionary? On the treat-
ment of fish and tree names in dictionaries of Oceanic languages. In G. Haig, N. Nau,
S. Schnell, & C. Wegener (Eds.), Documenting endangered languages: Achievements and
perspectives (pp. 263–287). Berlin: De Gruyter Mouton.
Pawley, A. (2015). Review of ‘Nicholas Thieberger (Ed). 2012. The Oxford handbook of lin-
guistic field-work’. Language Documentation & Conservation, 9, 134–139.
Pawley, A., & Bulmer, R. (2011). A dictionary of Kalam with ethnographic notes.
Canberra: Pacific Linguistics.
Sutton, P., & Walsh, M. (1987). Wordlist for Australian languages. Canberra: Australian
Institute of Aboriginal Studies.
Swadesh, M. (1950). Salish internal relationships. International Journal of American
Linguistics, 16, 157–167.
Thieberger, N. (2015). The lexicography of Indigenous languages in Australia and the Pacific.
In International handbook of modern lexis and lexicography (pp. 1–16). New York,
NY: Springer.
Turpin, M., Ross, A., Dobson, V., & Turner, M. K. (2013). The spotted nightjar calls when
dingo pups are born: Ecological and social indicators in central Australia. Journal of
Ethnobiology, 33(1), 7–32.
Turpin, M., & Si, A. (2017). Edible insect larvae in Kaytetye: Their nomenclature and signifi-
cance. Journal of Ethnobiology, 37(1), 120–140.
184
Voegelin, C. F., & Voegelin, F. M. (1971). The autonomy of linguistics and the dependence
of cognitive culture. In J. Sawyer (Ed.), Studies in American Indian Languages (pp. 303–
317). Berkeley, CA: University of California Press.
Wilkins, D. P. (1997). Handsigns and hyperpolysemy: Exploring the cultural foundations of
semantic association. In D. Tryon and M. Walsh (Eds.), Boundary rider: Essays in honour
of Geoffrey O’Grady (pp. 413–444). Canberra: Pacific Linguistics.
Yen, A. L. (2015). Conservation of Lepidoptera used as human food and medicine. Current
Opinion in Insect Science, 12, 102–108.
Young, R. W., & Morgan, W. (1994). Colloquial Navaho. A dictionary. New York,
NY: Hippocrene Books.
185
8.1 INTRODUCTION
Taking a broad view of language that encompasses more than just spoken language
opens up many fascinating possibilities for fieldwork. Sign languages are the main
medium of communication for some and documentation and analysis of the many
sign languages of the world is a dedicated field of expertise. Linguistic research
in the mid-twentieth century validated natural sign languages as fully-fledged lan-
guages in their own right. Gesture is ubiquitous andappears to occur whenever
hearing individuals communicate in face-to-face spoken interactions (Kendon,
2004a). As well as speaking, people may point, make quick maps and diagrams, and
interact with anything in reach, including small objects or props, and visual displays
such as computer screens or whiteboards. There seems to be no culture in the world
that does not employ forms of bodily action alongside speech. People even gesture
when they are talking on the telephone and their interlocutor is not in sight! Some
argue that using research methods that include gesture as a matter of course is good
practice, not only if you are specifically interested in gesture, but for what gesture
might reveal about speech itself (Seyfeddinipur, 2012). Other research questions are
focused on understanding more about the common ground that sign and gesture
share and what this can show about language in general.
To begin to understand what you might miss out on by not considering communi-
cative actions that either accompany speech or replace it, imagine that you are reading
a transcript of the spoken part of an interaction or of a narrative. Not only are expres-
sive elements of speech, such as intonation and prosody (§5.1.8), routinely left out of
such transcripts, but there are other gaps in the information needed to fully make sense
of what is being conveyed. The text reads, “They went over there”. But where, exactly, is
there? She said, “The river was about this wide”. But how wide? In these simple exam-
ples it is likely that actions that go with speech provide the missing link. While some
types of words are well-known gesture-attractors, a close look at human communica-
tion shows that gesture and speech work together in all sorts of other interesting ways.
Pointing tends to be closely melded to particular types of speech –for example Wilkins
(2003) has shown how in Arrernte (Pama-Nyungan, Australia) demonstratives (words
like ‘here’, ‘there’, ‘this’, ‘that’) go together with pointing gestures that are accurately
anchored to cardinal directions and form what he calls “composite demonstrative sig-
nals” (see also §6.7.5). This is one example of how an utterance may be incomplete
without considering the ways that speech and action work together.
There is some overlap in the research methodologies that can be used for both sign
language and gesture. One reason for this is because both sign and gesture share the
DOI: 10.4324/9780203701294-8
This chapter has been made available under a CC-BY-NC-ND license.
186
same kinesic/visual modality. Another comes down to the choices of language docu-
mentation tools towards which both sign language and gesture researchers gravitate.
This chapter gives some background on studies of sign languages (§8.2) and gesture
(§8.3) before discussing some of the research questions that arise in these fields of
research (§8.4). We discuss particular methodological considerations for working
with sign and gesture (§8.5), elicitation and filming techniques, and some of the tools
that are used for annotation (§8.6 and §8.7), and then briefly look at various forms
of representation of sign languages and gesture for theses and other publications
(§8.7.3). As this chapter can only hope to touch the surface, at the end we direct the
reader to some publications that deal in more detail with the many issues involved.
Sign language dictionaries are briefly discussed at the end of the chapter (§8.7.4).
There are many sign languages, including sign languages used by those who are
deaf –which are sometimes referred to as primary or natural sign languages; and
alternate or secondary sign languages, which may replace speech in some circum-
stances. There are also some artificial sign systems that encode speech in manual
form (see Pfau, 2012).1 All of these vary in terms of lexical and grammatical com-
plexity. Estimates of the exact number of sign languages vary, but most suggest that
the number is well over a hundred.2
While fully-fledged sign languages are the primary mode of communication
for the deaf, other types of sign languages are used in particular cultural contexts,
and these tend to have been developed and used by hearing people. For example,
in Central and northern Australia, Indigenous sign languages are used alongside
speech, gesture and other semiotic systems such as sand drawing. Sign may be used
by hearing interlocutors when they are in view, but out of earshot; and sign is used,
particularly by women, as a mark of respect when they are bereaved. Sign is also
used in some ceremonial contexts. Although some signs may be in everyday use
across the communities and learnt in early childhood, more complex repertoires
are learnt later in life. These sign languages have been described as ‘alternate’ as
they are not generally the main mode of communication in these communities,
but rather used instead of speech in particular cultural circumstances (Bauer, 2014;
Green & Wilkins, 2014; Kendon, 1988; Maypilama & Adone, 2013). Another exam-
ple is found in North America where a conventionalised sign language emerged
as a form of communication between First Nations communities that spoke many
different languages (Davis, 2015; Farnell, 2009).
Shared sign languages are those that are used by both hearing and deaf members
of a community. If a community is geographically isolated and has a large popula-
tion of people who cannot hear, a common sign language that everyone learns may
develop (Pfau, 2012). One of the most well-known examples of this was Martha’s
Vineyard Sign Language, once widely used on the island of Martha’s Vineyard off
the coast of Massachusetts in the United States. Other gestural communication
systems have arisen in cloistered religious communities where speech is forbidden
187
(Quay, 2015), and yet others in workplaces, such as sawmills and fish markets, where
speech is not practical because of noise or other environmental factors that get in
the way of audible communication. Another system, known as homesign, may
develop in a family context as a means of communication between deaf children
and their immediate family network. Some research has shown how sign languages
may emerge and develop once such children get together and form a cohort, typi-
cally in an educational context (Goldin-Meadow, 2005; Kegl, Senghas, & Coppola,
1999; Senghas, Kita & Özyürek, 2004).
Some recent research has focused on sign languages used in a range of non-
Western contexts, including those found in small-scale rural or ‘village’ communi-
ties (de Vos, 2016; de Vos & Pfau, 2015; Jepsen et al., 2015; Nyst, 2007; Zeshan &
Sagara, 2016; Zeshan & de Vos, 2012). The sociolinguistic profile of these village sign
languages varies when compared to sign languages that have developed in industri-
alised societies, where sign languages often evolved in the context of schools for the
deaf (Nyst, 2015, p. 108). In spite of recognition of the diversity and wealth of sign
languages many are highly endangered –for deaf sign languages this may be driven
by changes in technology, such as the availability of cochlear implants, and subse-
quent changes in the socio-cultural contexts of sign language use. In some com-
munities, more prestigious sign languages may overshadow lesser known local ones
(Nyst, 2015, p. 109). When it comes to ‘alternate’ sign languages, even though sign
knowledge may be strong in some domains, it may be that some of the cultural prac-
tices that defined their use are less prevalent than they used to be. Although more
elaborate knowledge of sign may be on the wane, new signs emerge to reflect the
communicative needs of changing social circumstances.
8.3 GESTURE
Interest in gesture dates back to classical antiquity, when gesture was recognised
as an important component of the art of rhetoric (see Kendon, 2004a, Chapter 3).
While most studies of gesture have focused on manual gestures, others consider the
role of movements of the head and torso, facial expressions, movements of the lips,
the eyes and so on. Whereas an element of a sign language can be described as “a
relatively stable, identifiable visual-gestural act with an associated meaning which is
reproduced with consistency by native signers” (Johnston & Schembri, 1999, p. 117)
gestures may be one-off and idiosyncratic, varying from one communicative event
to another or between their users. Even so, some types of gestures may be conven-
tionalised, and so have fairly consistent interpretations in a particular culture over
repeated occurrences (see the example of gestures from South Africa below). There
are also significant cross-cultural differences as to the way that various actions are
understood, and even as to what types of actions are socially acceptable. It remains
the case that much of what we know about gesture comes from laboratory-based
studies. Studies of gesture used in small-scale language communities are not so
common (but see, for example, Haviland, 2003; Le Guen, 2012; Wilkins, 2003).
There is certainly plenty of useful and exciting work to be done in the field!
18
The question of what counts as ‘sign’ and what is ‘gesture’ is not a trivial one.
Some have envisaged the range of visible human communicative actions as being
on a continuum, with gesture at one end, and primary sign languages at the other
(McNeill, 1992). However, as the term ‘gesture’ means different things in different
contexts, and because this term is difficult to nail down with precision some prefer
to not even use it at all. As Adam Kendon (2004b) has written, it is important to rec-
ognise “that ‘gesture’ is a term that covers a multitude of diverse activities” (2004b,
pp. 98, 99). Some suggest that a ‘gesture’ must be deliberately expressive, which dis-
tinguishes gestures from other actions such as adjusting one’s clothing, scratching
the head, or jiggling the feet (Kendon, 2004a). Others discuss the degree to which
various types of gesture that accompany speech involve the conscious intention of
the speaker, or the communicator (see McNeill, 2015). Some consider that ‘gesture’,
and in particular pointing, is also an important part of sign languages (Cormier,
Schembri, & Woll, 2013; Johnston, 2014, p. 34).
One type of gesture is referred to as a symbolic gesture, an emblem or a quot-
able gesture, and these can be used independently of speech and tend to have stable
meanings in a community (Kendon, 2004a, pp. 335–344).
Various attempts have been made to come up with other ways to classify different
types of gestures (see Kendon, 2004a, Chapter 6 for an overview). Some terms that
you may come across in the literature referring to co-speech gesture include iconic,
metaphoric, deictic and beat (McNeill, 1992). Iconic gestures are those that are
closely related to the meaning content of co-occurring speech (for example if some-
one is describing how to turn on a tap, and they make a turning action with their
189
hand); metaphoric gestures refer to more abstract ideas or categories (for exam-
ple, spoken space-time metaphors may be accompanied by gestures to the front,
behind or to either side of the speaker). Beats have no propositional content but
rather serve discourse functions as the hand moves “with the rhythmic pulsation of
speech” (McNeill, 1992, p. 15).
Deictic or pointing gestures are highly context-dependent and usually associ-
ated with movements of the finger or hand, although directed nodding of the head,
shifts of the torso, eye-gaze and lip-pointing may all be found (Enfield, 2009).
People often augment manual pointing actions by using material objects –sticks,
electronic ‘pointers’, computer cursors and so on. Pointing gestures can refer to
objects in reach or to those that are further away, to abstract entities, such as ref-
erents in discourse, or to objects in virtual spaces. Some scholars are critical of
attempts to draw hard and fast distinctions between types of gesture by showing
that many of these semiotic properties can be combined. Thus various combina-
tions of iconic, metaphoric, deictic and beat-like properties of an action may be
found together.
There are many compelling reasons to study sign languages. Research on sign lan-
guages contributes to explorations of many aspects of linguistic theory, and this is
evident in debates focusing on the nature of the human language capacity, its rela-
tionship to other aspects of communication and cognition, and questions such as
whether or not the diverse range of languages we see in the world today share some
universal patterns of organisation (Evans & Levinson, 2009; Pinker, 1994). One of
the key questions that research on sign languages can help answer is whether or
not language universals are modality-specific (de Vos & Pfau, 2015; Palfreyman,
Sagara, & Zeshan, 2015, p. 187).
Other research questions include understanding how sign languages emerge
and change over time, and how children acquire sign (and how these processes
compare with the spoken modality). Some investigate the nature of sign lan-
guage contact with other signed languages or with other communicative sys-
tems, and the ways signers code-switch and vary their use of sign repertoires in
particular sociolinguistic circumstances (Palfreyman, 2016). Psycholinguistic
approaches to understanding how sign is produced and processed is another field
of inquiry. Other researchers are trying to come to grips with questions such as
how many sign language families there are in the world (Pfau & Zeshan, 2016).
Descriptions of non-Western sign languages further our understandings of sign
language typology, and this has implications for the field of language typology
in general (Le Guen, 2012, p. 244). Some research compares actions used by
signers and those used by non-signers. This may illuminate the ways that ges-
ture may influence the development of sign language lexicons and the interplay
between the spontaneous actions that accompany speech and the role of gesture
in fully-fledged sign languages (Cormier et al., 2012). Sign language research
190
can also have important practical outcomes, assisting in the creation of resources,
such as dictionaries, that help people learn and maintain sign languages, as well
as raising general community awareness of sign (§8.7.4).
Looking at gesture and its relationship to speech provides an opportunity to get
under the ‘cognitive hood’ and explore many aspects of language and thought (for
an overview of some experimental studies see Goldin-Meadow & Alibali, 2013).
Some studies have investigated the correlation between the grammatical structures
and semantic categories of a language, and the types of co-speech gestures pro-
duced by speakers of those languages. For example, by comparing the gestures of
English, Turkish and Japanese speakers, Kita & Özyürek (2003) concluded that the
semantic categories of a language do have an influence on what is displayed in ges-
ture (see also Akhavan, Nozari, & Göksun, 2017).
8.5.5 Anonymity
When it comes to sign language and gesture research there are additional considera-
tions that may need to be addressed in formal ethics applications (§2.5). The first has
to do with the nature of the documentation and the use of video, which makes it virtu-
ally impossible to anonymise the research participants. In reference to video record-
ings of New Zealand Sign Language, McKee & McKee (2013, p. 518) have written that
“(v)ideo clips are embodied linguistic performance by individuals who have social
identities and styles that are immediately recognizable to their own language com-
munity”. In some communities the identity of people is also read from other cues –the
shape of their hands, characteristic scars or body markings, and clothing etc.
193
As is the case for spoken languages, a broad distinction can be made between
naturalistic methods, which attempt to find out how sign or gesture is used in
face-to-face interaction (§8.7.2), and elicitation and (semi-)experimental meth-
ods which are effective in targeting certain domains, and also in providing data
194
that is comparable across languages (§8.6.1). Not all of these methods will be
practical in the field. On the other hand, getting out of laboratory-based situ-
ations is important if knowledge of gesture and sign language diversity is to be
advanced. A discussion of some of these methods for sign languages is given in
Nyst (2015, pp. 115, 116).
most fruitful and rich recording sessions and perhaps some of the most interesting
theoretical insights happen when the unexpected occurs.
It may be preferable to use pictures and props rather than written words from a
spoken language, but again this depends of the levels of literacy in your research
team. It is likely that tasks that rely on some form of written literacy will yield very
different results from those that do not. Also be aware that some image-based stim-
uli may be difficult to use in signing communities, as the interpretation of images
may require forms of literacy that participants are not familiar with (Nyst, 2015,
p. 116). An elicitation set can be made by uploading pictures of local objects to a
laptop computer and presenting this as a slide show. If using pictures of people that
are locally known, make sure there are no gender-specific constraints about show-
ing photos in public (Padden, 2015, p. 144).
In §7.5 we discussed various non-linguistic stimuli that are used to explore
semantic and grammatical dimensions of languages. Many of these tasks, for
example director-matcher tasks and the use of video clips as stimuli, can be
used for sign language research. Signers may be asked to watch a video and then
describe it to another signer who then repeats the description (Padden, 2015,
p. 148). For example, an animated colour cartoon series called Canary Row,
which formed the basis of studies of gesture in the McNeill laboratory, has also
been used by sign language researchers (see Emmorey et al., 2008). The cartoon
stimulus is shown to a participant and they then have to recount the story to a ‘lis-
tener’ who has not seen the cartoon. The wordless picture story, Frog, Where Are
You? (Mayer 1994 [1969]), has also been used to elicit narratives in sign (see also
§[Link] and §[Link]). Elicitation can target particular domains, such as posses-
sive, reciprocal or locative constructions, or it can be used to explore sign order
in multi-sign utterances.
The Language and Cognition group at the Max Planck Institute for
Psycholinguistics in Nijmegen has designed a suite of prop-based tasks, including
the ‘Man and Tree’ game and the ‘Animals in a Row’ task (Levinson et al., 1992).
These have also been used in sign language research (see de Vos, 2012).4 If you
are interested in exploring the gestural repertoires of a speech community, choose
gesture-rich topics. Discussions about route finding, and procedural or ‘how to’
texts, such as talking about ways to cook a meal or fix a car, may all yield rich data
that has a lot of action in it (Seyfeddinipur, 2012).
Another application of this sort of methodology was used in a study of sand sto-
ries from Central Australia (Green, 2014). In sand drawings, a small set of graphic
symbols drawn on the ground have conventionalised meanings, but one research
question was to investigate the extent to which the graphic symbols alone would
have shared meanings to those who are familiar with the practice. To investigate
this, a ‘sand quiz’ was developed by selecting a set of films of the drawing action,
the bird’s-eye view in the video data (see §4.3.3), into three small movies for use as
an elicitation tool. The aim was to test the extent to which particular sand drawing
forms have meanings that are independent of context, and to thus provide insights
into how dynamic graphic information is understood by interlocutors. The use of
‘visual questionnaires’ to achieve layered interpretations of communicative events
is recognised by some as being an important technique in ethnographic data col-
lection. It also highlights the importance of participant involvement in determining
the meanings of different types of action.
presented a smooth mono-tonal surface even after it had been used repeatedly and
carried in the back of a truck for many thousands of kilometres. Figure 8.1 shows
a mobile studio (the blue curtain is inside the tent) that provided a comfortable
space for filming sign in remote conditions. Such methods may be adapted to suit
particular field circumstances, and even without the tent a portable length of fabric
may be useful. Remember to pack the gaffer tape!
Another consideration is the way that signers orient themselves to the camera.
Direct interaction of a signer with the camera as a default ‘interlocutor’ may be
advantageous for filming examples of sign for a website or online sign diction-
ary, where simulating eye contact with imagined online audiences is the aim, but
in other contexts direct continuous eye contact with the camera may seem very
unnatural. If you are interested in more natural interactions then it is much better
to film several people together, angled towards each other and at a slight angle to
the camera. If you have two cameras, then it is possible to arrange them so that each
films either the signer or the addressee. A top-down view is useful if you want to
know more precise details of sign articulation in relation to the body. The top-down
view can be filmed by attaching a camera to a tripod, a lighting stand (see §3.3.3) or
even improvised solutions such as strapping the camera to a plank tied to a garden
ladder. Keep in mind that you don’t need to have three cameras! Natural conversa-
tional interactions, either signed or spoken, that are filmed less laboriously with one
camera can also provide valuable data for linguistic analyses.
As discussed earlier, with all recordings it is important to keep note of the con-
text. Sketch maps or photographic records of community layouts, or if inside, of the
Figure 8.1
Documenting alternate sign languages in Central Australia with Anmatyerr women Clarrie Kemarr
and Eileen Perrwerl, and linguist Gail Woods. The blue curtain is hung inside the tent ‘studio’
(Photo: Jennifer Green 2012)
19
orientation and physical features of a room can be useful down the track and can
be added to your field notebooks (§3.3.4). If, for example, you are trying to find out
how space is used to encode time, then knowing whether or not a signer or gesturer
is pointing to external spaces (outside a room), or to a clock on the wall inside is a
crucial piece of information that may tip your analysis in one direction or the other.
The software ELAN is currently the tool of choice for many who work on sign lan-
guages and gesture (Crasborn & Sloetjes, 2008). One of the signature features of
ELAN is the way it allows users to design their own template for data annotation.
Each level in a template is called a tier, and in Chapter 4 we discussed a simple tem-
plate which had only two tiers (§4.3.2). Tiers have a range of attributes that can be
chosen when the tier is first designed. These include the hierarchical relationships
between tiers (what is the ‘parent’ tier and which tiers are dependent i.e. the ‘child’
tiers). It is also possible to add controlled vocabularies to the attributes of a tier and
thus limit the entry values of the annotations that can be added to a particular tier
in the time-line. So for example you may want to set up a controlled vocabulary that
defines values for grammatical categories, or the range of hand shapes that are used.
As discussed in (§4.3.3), choices have to be made about segmentation of the data
and this will influence the design of your template. Although it is possible to use a
simple template that only has one or two levels to annotate sign or gesture, many
projects allow for very detailed annotation of sign or gesture forms. It all comes
down to what the research questions are.
Another popular tool used for annotation of sign and gesture is ANVIL, origi-
nally developed for gesture research in 2000. It offers multi-layered annotation
based on a user-defined coding scheme. During coding, the user can see time-
aligned and colour-coded elements on multiple tracks. ANVIL can import data
from phonetic tools like Praat, and it can display wave forms and pitch contours.
A feature of ANVIL that is not offered by ELAN is that it supports 3D viewing of
motion capture data (Kipp, 2014; see Perniss, 2015, p. 69).
Figure 8.2
Phases of an action that accompanies speech. Ngaatjatjarra linguist Lizzie Marrkilyi Ellis works with her
mother Tjawina Porter, Alice Springs, Northern Territory, Australia, 2012 (Video stills: Jennifer Green)
Figure 8.3
An ELAN template that allows for annotation of the phases of an action, and for morphological
glossing of the co-occurring speech
201
The template has tiers for speech, for segmentation at the word level, for morpho-
logical glosses, and for the action that accompanies the speech. It shows clearly that
the action and speech co-occur, and overlap in interesting ways.
If your research question is about the relationship between speech and the
actions that accompany it you may want to include various gesture types as a
controlled vocabulary in your ELAN template, even as you keep in mind the
fact that it is at times difficult to be categorical about identification of the types
(§8.3). As seen in Figure 8.4 these will then appear as a drop-down menu
item when you go to add a value to the selected annotation. Using controlled
vocabularies in ELAN is one way to eliminate variation that is caused by spell-
ing things badly, and of course this makes searches for annotation values more
efficient.
An example of a more complex sign language ELAN template is the Auslan
(Australian Sign Language) one, which has over 40 tiers. This template includes
Figure 8.4
A screenshot from ELAN, showing the controlled vocabulary option which enables you to set the
values for an annotation
20
separate tiers for the right and left hands; tiers for aspects of sign action such as
handshape, orientation, location and movement; and tiers for non-manual aspects
of signing such as mouthings and movements of the face and brows (Johnston,
2014). Controlled vocabularies in ELAN may be used to show whether an action
is a lexical sign, a depicting sign, an instance of pointing and so on. For sign lan-
guages, sign ID glosses, free translations, grammatical tags and other features of the
sign action and function may be added in. Sign ID glosses are usually keywords
in a spoken language that partially reflect the meaning of the sign (Johnston, 2010,
p. 123). It is important to note that they are not full translations of the sign’s mean-
ing, but rather unique and succinct identifiers of the sign (§7.6.2). The convention
in sign language studies is to represent sign ID glosses in capital letters.
Figure 8.5 is a screenshot taken from an ELAN annotation file for a video
of NGT (Sign Language of the Netherlands) (Crasborn et al., 2004).7 You can
see that at least three cameras have been used, and the images that are visible
in Figure 8.5 (from CAM 2 and CAM 3) give a distant and closer view of the
signer. In the right hand panel of the frame you can see a series of sign ID glosses
(in Dutch) that have been added to the annotation tier. These include hond
‘dog’, slager ‘butcher’, vlees ‘meat’ and bot ‘bone’. Sign ID glosses are also vis-
ible in the timeline viewer. Translations of the signed utterance are shown in
both Dutch and English. ‘Een hond liep op zijn gemak …’ ‘A dog was walking
around …’. Although only part of the template is visible, it has 19 tiers, including
those for translations of the Dutch ID glosses into English, a tier for mouth
Figure 8.5
Representation of a screenshot from an ELAN annotation file, showing sign ID glosses in Dutch
(Illustration: Maxine Addinsall)
203
actions, movements of the eyes and brows and many more. Compare this to the
template illustrated in §4.3.1.
If there are a lot of tiers in your ELAN template and they get in the way they can
be hidden from view to give you more computer screen space to work with (simply
right click on the tiers and choose which ones you want to either hide or reveal).
The tiers can be brought out again when needed. It is better in the long run to do
this rather than being tempted to make a simpler template, especially if you are
working in a group that shares a template, and you have a long-term aim of making
a searchable corpus. Depending on the level of detail you want to go in to, annota-
tion of sign and gesture can be very time-consuming. One way to lighten the load is
to work in teams (see also §8.5.1).
Figure 8.6
The Anmatyerr sign mother (illustration: Jenny Taylor)
205
Figure 8.7
Pompey Raymond demonstrates the sign for lukarrarra (Portulaca filifolia), Elliott, Northern Territory,
2017 (Photo: Jennifer Green, Video: Felicity Meakins)
For example |……| shows that it occurred on the ground (drawing); and
|^^^^| in the air (gesture and sign). The convention |====| indicates
erasure of the drawing space. Figure 8.9 shows part of the action of the
Anmatyerr sign husband/wife (see also Green, 2014, pp. 96–98).
Figure 8.8
A schematic representation of graphic units drawn on the ground: (a) the first sequence; (b) the
second sequence
1|…………| 2|……..|
(01) Arelh lyw-el an-ek
woman shade-LOC sit-PST
3|………| 4|……………..|
(02) apanth-akert anem-akert thwen
dish-PROP [Link]-PROP [Link]
|=================================|
(04) Kwer-penh ra [Link]-
ek
3sgDAT-SEQ 3sgNOM [Link]-PST
1|…………….| 2|……………….|
(05) Rwanenty arrpenh-warn anem alh-ek
[Link] another-ALL then go-PST
3|………|
(06) Rwa amp-em
fire burn-PRS
208
A woman was sitting in the shade of a tree with her digging stick and dish. A
man, [her] husband, went past in a westerly direction. Then the woman got
up and went to another dinner camp. A fire is burning.
Figure 8.9
The Anmatyerr sign husband/wife (Illustration: Jenny Taylor)
Some technical issues with online dictionaries of sign –for example the need to
enable smooth display of sign videos in different online environments – are also
considerations for dictionaries of spoken languages that aim to include audio or
video, and for other multimedia language resources. As is the case with other dic-
tionary projects, a team working on a sign language dictionary needs to think care-
fully about the relationship between the dictionary and the underlying corpus on
which it is based. As discussed in the case study below, this includes issues such as
whether or not to create discrete clips for the dictionary, or whether to extract them
by time code from the primary archival files.
210
Figure 8.10
The Iltyem-iltyem sign language project workflow
A (iii) Compile
A (iv) Export
& edit
(ii) Transfer
(i) Record
video
B (iii) Create
B (iv) Transcribe B (vi) Build
media B (v) Export
& annotate website posts
respository
21
8.8 SUMMARY
NOTES
1 One convention is to capitalise the term Deaf when referring to communities of indi-
viduals who identify with aspects of Deaf culture, and to use lower case ‘d’ when speaking
solely about hearing loss. However, there is regional and personal variation as to how this
convention is applied.
2 Ethnologue lists 142 sign languages [Link]/subgroups/sign-language
Accessed 21 August 2017. ISO codes are either registered or under review for 107 of
these. See [Link]/standards/iso639/[Link] Accessed 21 August 2017.
3 [Link] Accessed 27 August 2017.
4 [Link] Accessed 31 August
2017.
5 See [Link] Accessed 27 August 2017.
6 Source: SIGN20121031-JP&EG-02-[Link] (02:34–02:40 min).
7 [Link]/sign-lang/projects/completed-projects/echo/ Accessed 20 September 2017.
8 [Link]/Accessed 5 December 2017.
9 [Link] Accessed 17 August 2017.
21
10 To view the film clip download a QR Code Reader to your mobile device.
11 [Link]/about/news/new-technology-supports-language-development-signed-
languages Accessed 27 August 2017.
12 [Link]-[Link] Accessed 22 August 2017.
13 [Link] Accessed 27 August 2017.
14 [Link] Accessed 22 August, 2017.
15 [Link]/sign-languages/sign-language-resources Accessed 3 August 2017.
REFERENCES
Akhavan, N., Nozari, N., & Göksun, T. (2017). Expression of motion events in Farsi.
Language, Cognition and Neuroscience, 32(6), 792– 804. [Link]
23273798.2016.1276607
Bauer, A. (2014). The Use of Signing Space in a Shared Sign Language of Australia (Vol. 5).
Berlin: De Gruyter Mouton and Ishara Press.
Boroditsky, L., Gaby, A., & Levinson, S. C. (2008). Time in space. In A. Majid (Ed.), Field
manual (Vol. 11, pp. 52–76). Nijmegen: Max Planck Institute for Psycholinguistics.
Brookes, H. (2004). A repertoire of South African quotable gestures. Journal of Linguistic
Anthropology, 14(2), 186–224.
Carew, M., & Green, J. (2015). Making an online dictionary for Central Australian sign
languages. Learning Communities: International Journal of Learning in Social Contexts.
Special issue: Indigenous sign languages, 16, 40–55.
Caselli, N. K., Sehyr, Z. S., Cohen-Goldberg, A. M., & Emmorey, K. (2016). ASL-LEX: A
lexical database of American Sign Language. Behavior Research Methods. [Link]
10.3758/s13428-016-0742-0
Cormier, K., Quinto-Pozos, D., Sevcikova, Z., & Schembri, A. (2012). Lexicalisation and
de-lexicalisation processes in sign languages: Comparing depicting constructions and
viewpoint gestures. Language & Communication, 32(4), 329–348.
Cormier, K., Schembri, A., & Woll, B. (2013). Pronouns and pointing in sign languages.
Lingua, 137, 230–247.
Crasborn, O. (2015). Transcription and notation methods. In E. Orfanidou, B. Woll, &
G. Morgan (Eds.), Research methods in sign language studies: A practical guide (Vol. 6,
pp. 74–88). London: Wiley-Blackwell.
Crasborn, O., & Hanke, T. (2003). Metadata for Sign Language Corpora. Background
Document for an ECHO Workshop. Radboud University, Nijmegen, 8–9 May 2003.
Crasborn, O., & Sloetjes, H. (2008). Enhanced ELAN functionality for sign language corpora.
In O. Crasborn, T. Hanke, E. Efthimiou, I. Zwitserlood, & E. D. Thoutenhoofd (Eds.)
Construction and exploitation of sign language corpora. Proceedings of the Third Workshop
on the Representation and Processing of Sign Languages (pp. 39–43). Paris: ELDA.
Crasborn, O., van der Kooij, E., Nonhebel, A., & Emmerik, W. (2004). ECHO data set for Sign
Language of the Netherlands (NGT). Department of Linguistics, Radboud University
Nijmegen.
Davis, J. (2015). North American Indian Sign Language. In J. B. Jepsen, G. De Clerck,
S. Lutalo-Kiingi, & W. B. McGregor (Eds.), Sign languages of the world: A comparative
handbook (pp. 911–931). Berlin: De Gruyter Mouton and Ishara Press.
de Vos, C. (2012). Sign-spatiality in Kata Kolok: How a village sign language in Bali inscribes
its signing space. Nijmegen: Radboud University.
de Vos, C. (2016). Sampling shared sign languages. Sign Language Studies, 16(2), 204–226.
213
de Vos, C., & Pfau, R. (2015). Sign language typology: The contribution of rural sign
languages. Annual Review of Linguistics, 1(1), 265– 288. [Link]
annurev-linguist-030514-124958
Defina, R. (2016). Do serial verb constructions describe single events?: A study of co-speech
gestures in Avatime. Language, 92(4), 890–910.
Emmorey, K., Borinstein, H. B., Thompson, R., & Gollan, T. H. (2008). Bimodal bilingualism.
Bilingualism: Language and Cognition, 11(1), 43–61.
Enfield, N. J. (2009). The anatomy of meaning: Speech, gesture, and composite utterances.
Cambridge; New York, NY: Cambridge University Press.
Evans, N., & Levinson, S. C. (2009). The myth of language universals: Language diversity
and its importance for cognitive science. Behavioral and Brain Sciences, 32(5), 429–448.
Farnell, B. (2009). Do you see what I mean?: Plains Indian sign talk and the embodiment of
action. Lincoln, NE: University of Nebraska Press.
Goldin-Meadow, S. (2005). The resilience of language: What gesture creation in deaf children
can tell us about how all children learn language. New York, NY: Psychology Press.
Goldin-Meadow, S., & Alibali, M. W. (2013). Gesture’s role in speaking, learning, and creat-
ing language. Annual Review of Psychology, 64(1), 257–283.
Green, J. (2014). Drawn from the ground: Sound, sign and inscription in Central Australian
sand stories. Cambridge: Cambridge University Press.
Green, J., & Wilkins, D. P. (2014). With or without speech: Arandic sign language from
Central Australia. Australian Journal of Linguistics, 34(2), 234–261. [Link]
10.1080/07268602.2014.887407
Haviland, J. B. (2000). Pointing, gesture spaces and mental maps. In D. McNeill (Ed.),
Language and gesture (pp. 13–45). Cambridge: Cambridge University Press.
Haviland, J. B. (2003). How to point in Zinacantán. In S. Kita (Ed.), Pointing: Where language,
culture, and cognition meet (pp. 139–169). New York, NY: Psychology Press.
Hoiting, N., & Slobin, D. I. (2007). From gestures to signs in the acquisition of sign language.
In S. D. Duncan, J. Cassell, & E. T. Levy (Eds.) Gesture and the dynamic dimension of
language: Essays in honor of David McNeill (pp. 51–65). Amsterdam: John Benjamins.
Jepsen, J. B., De Clerck, G., Lutalo-Kiingi, S., & McGregor, W. B. (2015). Sign languages of the
world: A comparative handbook. Berlin: Mouton de Gruyter and Ishara Press.
Johnston, T. (2010). From archive to corpus: Transcription and annotation in the creation
of signed language corpora. International Journal of Corpus Linguistics, 15(1), 104–129.
Johnston, T. (2014). Auslan Corpus Annotation Guidelines [June version]. Sydney: Macquarie
University. [Link]/about/annotations/
Johnston, T., & Schembri, A. (1999). On defining lexeme in a signed language. Sign Language
& Linguistics, 2(2), 115–185.
Kegl, J., Senghas, A., & Coppola, M. (1999). Creation through contact: Sign language emer-
gence and sign language change in Nicaragua. In M. DeGraff (Ed.), Comparative gram-
matical change: The intersection of language acquisition, creole genesis, and diachronic
syntax (pp. 179–237). Cambridge, MA: MIT Press.
Kendon, A. (1988). Sign languages of Aboriginal Australia: Cultural, semiotic and communica-
tive perspectives. Cambridge: Cambridge University Press.
Kendon, A. (2004a). Gesture: Visible action as utterance. Cambridge: Cambridge University
Press.
Kendon, A. (2004b). Review of “Hearing gesture: How our hands help us think” by Susan
Goldin Meadow. Gesture, 4(1), 91–101.
Kipp, M. (2014). Anvil: The video annotation research tool. In J. Durant, U. Gut, &
G. Kristofferson (Eds.) The Oxford Handbook of Corpus Phonology (pp. 420–436).
Oxford: Oxford University Press.
214
Kita, S., & Özyürek, A. (2003). What does cross-linguistic variation in semantic coordination
of speech and gesture reveal?: Evidence for an interface representation of spatial thinking
and speaking. Journal of Memory and Language, 48(1), 16–32.
Kusters, A. (2009). Deaf on the lifeline of Mumbai. Sign Language Studies, 10(1), 36–68.
Kusters, A. (2012). Being a deaf white anthropologist in Adamorobe: Some ethical and meth-
odological issues. In U. Zeshan & C. de Vos (Eds.) Sign languages in village communi-
ties: Anthropological and linguistic insights (pp. 27–52). Berlin: De Gruyter Mouton and
Ishara Press.
Kusters, A., De Meulder, M., & O’Brien, D. (Eds.) (2017). Innovations in Deaf Studies: The role
of deaf scholars. Oxford: Oxford University Press.
Le Guen, O. (2012). An exploration in the domain of time: From Yucatec Maya time ges-
tures to Yucatec Maya Sign Language time signs. In U. Zeshan & C. de Vos (Eds.) Sign
languages in village communities: Anthropological and linguistic insights (pp. 209–250).
Berlin: De Gruyter Mouton and Ishara Press.
Levinson, S. C., Brown, P., Danziger, E., De León, L., Haviland, J.B., Pederson, E., & Senft,
G. (1992). Man and tree and space games. In S. C. Levinson (Ed.), Space stimuli kit 1.2:
November 1992 (pp. 7–14). Nijmegen: Max Planck Institute for Psycholinguistics.
Mayer, M. (1994) [1969]. Frog, where are you? China: Puffin.
Maypilama, E., & Adone, D. (2013). Yolŋu sign language: An undocumented language of
Arnhem Land. Learning Communities, (13), 37–44.
McKee, R. L., & McKee, D. (2013). Making an online dictionary of New Zealand Sign
Language. Lexikos, 23(1), 500–531.
McKee, R. L., & Vale, M. (2016). Sign language lexicography. In P. Hanks & G.-M de
Schryver (Eds.), International handbook of modern lexis and lexicography. (pp. 1–22).
Heidelberg: Springer Verlag.
McNeill, D. (1992). Hand and mind: What gestures reveal about thought. Chicago, IL:
University of Chicago Press.
McNeill, D. (2015). Why we gesture: The surprising role of hand movements in communication.
Cambridge: Cambridge University Press.
Müller, C., Cienki, A., Fricke, E., Ladewig, S. H., McNeill, D., & Teßendorf, S. (Eds.) (2013).
Body-language-communication:An international handbook on multimodality in human
interaction. Berlin: Walter de Gruyter.
Nyst, V. (2007). A descriptive analysis of Adamorobe sign language (Ghana). Utrecht:
Netherlands Graduate School of Linguistics.
Nyst, V. (2015). Sign language fieldwork. In E. Orfanidou, B. Woll, & G. Morgan
(Eds.), Research methods in sign language studies: A practical guide (pp. 107–122).
London: Wiley-Blackwell.
Orfanidou, E., Woll, B., & Morgan, G. (2015). Research methods in sign language studies:
A practical guide. London: Wiley-Blackwell.
Padden, C. A. (2015). Methods of research on sign language grammars. In E. Orfanidou, B.
Woll, & G. Morgan (Eds.), Research methods in sign language studies: A practical guide
(pp. 141–155). London: Wiley-Blackwell.
Palfreyman, N. (2016). Variation in Indonesian sign language. Berlin: De Gruyter Mouton
and Ishara Press.
Palfreyman, N., Sagara, K., & Zeshan, U. (2015). Methods in carrying out language typo-
logical research. In E. Orfanidou, B. Woll, & G. Morgan (Eds.), Research methods in sign
language studies: A practical guide (pp. 173–192). London: Wiley-Blackwell.
Perniss, P. (2015). Collecting and analyzing sign language data: Video requirements and use
of annotation software. In E. Orfanidou, B. Woll, & G. Morgan (Eds.), Research methods
in sign language studies: A practical guide (pp. 55–73). London: Wiley-Blackwell.
215
9.1 INTRODUCTION
The great comedian W. C. Fields is credited with the quip, “Never work with ani-
mals or children!” Fieldwork on language acquisition, the study of how children
learn language(s), brings with it the usual excitement and challenges, but there are
also some specific considerations to think about if you do want to work with chil-
dren. This chapter provides some helpful advice for people considering an acquisi-
tion study, including funding requirements (§9.3.1), choosing a field site (§9.3.2),
whether documentation of the adult language already exists (§9.3.3), ethics and
consent (§9.3.4), potential gender problems (§9.3.5), health considerations (§9.3.6)
and recruitment (§9.3.7). We then give a snapshot of some of the standard meth-
odologies used in acquisition research and how they can be adjusted for the field,
with a view to understanding where compromises can be made to take into account
some of the difficulties of field research (§9.4).
The field of language acquisition in general also has many good methodologi-
cal overviews for how to conduct studies (Blom & Unsworth, 2010; Ingram, 1989;
Rowland, 2013; Stoll, 2015). Nonetheless, field studies of child language acquisition
present many unique challenges, such as the level of funding required to undertake
longitudinal studies requiring repeat fieldwork, the extent to which the target lan-
guage has been documented and the time it takes to transcribe children’s speech
(even you are an L1 speaker of the language!). These challenges come on top of the
usual considerations for language documentation (see Chapters 2–4) and are dis-
cussed in more detail in Kelly, Kidd et al. (2015) and Eisenbeiss (2006). How do you
aim for best practice in child acquisition research, given the additional challenges
posed by fieldwork? In this chapter, we discuss various issues and methodologi-
cal challenges in the context of three case studies of field acquisition projects –
Chintang, Murrinhpatha and Light Warlpiri.
DOI: 10.4324/9780203701294-9
This chapter has been made available under a CC-BY-NC-ND license.
217
such as head direction and pro-drop (Chomsky, 1981). More recently, it has been
replaced with a single principle of ‘recursion’ (Fitch, Hauser, & Chomsky, 2005) and,
since then, with ‘merge’ (Yang, Crain, Berwick, Chomsky, & Bolhuis, 2017).1 Other
theories, such as the constructivist approach of Tomasello (2003), claim that general
cognitive mechanisms such as pattern finding, joint attention, imitation and statis-
tical learning are sufficient for children to piece together language from surround-
ing speech. All of these approaches assume that children learn language through an
interplay of innate mechanisms, whether specific to language or domain-general,
and the input of adult language.
To investigate how children learn language, we need acquisition studies from a
diverse range of languages to determine what is common in how children acquire
these languages and how acquisition situations differ. We currently only have acqui-
sition studies from around 2 per cent of the world’s 6,000–7,000 languages, and
only corpora from 0.1 per cent. Most of these studies and corpora also only come
from a sample of Indo-European languages from Western Europe, such as English,
French, Italian and German (Lieven & Stoll, 2013; Stoll, 2015). As a result, theories
of acquisition have been biased towards the quirks of these languages, such as rela-
tive pronoun constructions, e.g. the cat that ate the mouse, which are actually rare
in the world’s languages.
Another problem with the current state of play is that many Western European
languages do most of the work of grammar in their syntax rather than by their
morphology. For example, in the sentence The boys hugged Mary, we know that
it was the boys who hugged Mary because subjects come before verbs in English.
English and other Western European languages have relatively simple morphologi-
cal structures, so many theories of acquisition have proposed rule-based learning
processes to account for how children acquire inflectional morphology such as plu-
ral suffixes, e.g. boys (Berko, 1958), or past tense inflections, e.g. hugged (Pinker &
Ullman, 2002). But the acquisition task is different if you are a child acquiring a
polysynthetic language such as Chintang. Chintang expresses the subject, object
and tense on the verb, so children acquiring Chintang need to learn many more
verb forms than English children before they fully master the language –up to 4,000
distinct forms compared with just four in English i.e. hug, hugs, hugged, hugging
(Stoll, Mazara, & Bickel, 2017)! The question then becomes, do children learn the
rules first and then build up the verb (hug + ed = hugged); or do they learn the whole
verb first as an unanalysed chunk and develop the rules as they acquire more and
more verb forms (hugged, chopped, sipped = [hug, chop, sip] + ed)?
a first language by most children in the community, but children also grow
up surrounded by Nepali, which is the dominant language of Nepal. Nepali
is the medium of instruction in schools, so children’s knowledge of Nepali
strengthens when they enter school.
Chintang is a polysynthetic language which means something that requires
a whole sentence to say in English can be said in a single verb in Chintang! The
verb is very rich in morphology, encoding the usual sorts of verbal categories
(e.g. tense, aspect, mood), as well as information about the subject and object
(e.g. person, number). Nouns are also morphologically complex, inflecting for
case and number, but they are not obligatory, and hence sentences are verb heavy.
The agentivity bias was originally proposed through studies of languages, such as
English, which group transitive and intransitive subjects (cf. Pinker, 1984). Work
on lesser-known languages is already demonstrating some of the similarities and
differences in how children learn languages across the world (ultimately provid-
ing a window into how language works in the human mind).
Of course, these reasons for studying language acquisition come from the point of
view of linguistic research. If you are an ‘insider’ linguist studying your own language,
you might have different motives for conducting an acquisition study. For example, if
your community has a school where children are taught in their own language, you
may wish to gain a better understanding of the order in which children acquire differ-
ent aspects of your language. It also may be useful for teachers to know what language
skills children bring to school and what areas of their language require greater atten-
tion within the framework of a formal curriculum. Being able to establish the norms
of acquisition in your language can also help identify children who are struggling and
might require extra support. If your language is endangered, a language acquisition
study may help identify aspects of language which need prioritising.
Luckily you aren’t the first person to conduct a field study of language acquisition!
There are some great studies that you can read to help you get started. Some of
the more substantial acquisition studies have been Elinor Ochs’ (1988) work on
Samoan (Polynesian, Samoa), Penny Brown’s (1998, 2001) work on Tzeltal (Mayan,
Mexico), Katherine Demuth’s (1992) corpus of Sesotho acquisition and Shanley
Allen’s (1998, 2013) research on Inuktitut (Eskimo-Aleut, Canada). As we dis-
cuss the specifics of field methodologies, we give you a snapshot of the Chintang,
Murrinhpatha and Light Warlpiri acquisition studies. These innovative case studies
provide some interesting ‘work arounds’ to accommodate some of the difficulties of
acquisition studies in the field.
9.3.1 Funding
Field studies of acquisition are very expensive, and should not be undertaken with-
out considerable financial support. Longitudinal studies of children learning their
first language (§9.4.1) require the children to be recorded at regular intervals which
means repeat trips to a field site or employing a local researcher for considerable
20
lengths of time to conduct the recording sessions. In addition, more funding is also
often required for participant payments than in other types of projects because
transcription usually needs to be done with caregivers in addition to field assis-
tants because caregivers are the best interpreters of their own children’s utterances
(§[Link]). Experimental studies can also be very expensive because they require a
corpus in the first place to help design the research questions, and also require large
numbers of participants (§9.4.2). If you are interested in an acquisition project, it
is a good idea to look out for advertisements for PhD students at large institutes,
centres or within well-funded projects. Or if you have a field site in mind, pitch an
idea to a potential supervisor. The design of acquisition projects has been developed
over decades and is necessary to ensure interpretable (and ultimately publishable)
results. Such results cannot be achieved with small studies which may leave too
many unanswered questions.
spoken in the community had not been described sufficiently to begin an acquisi-
tion study. In this case, O’Shannessy documented the use of the ergative marker in
both the adult language and by children simultaneously (Meakins & O’Shannessy,
2010; O’Shannessy 2011). For suggestions on how to document languages in lan-
guage shift scenarios, see §10.4.
Background police checks are required by law in many countries before you
begin working (paid or unpaid) with children to ensure that you have not previ-
ously committed a crime against a child. For example, in Australia, these are called a
‘Working with Children Check’ or ‘Working with Vulnerable People’ and successful
applicants are issued with a ‘blue card’ or ‘ochre card’ depending on the jurisdic-
tion. In Canada, anyone working with children has to get a ‘Criminal Record and
Vulnerable Sector Check’ from the police. It is also worth getting familiar with any
local welfare policies such as the mandatory reporting of suspected child neglect
or abuse.
Payment should be appropriate to the country and community you are work-
ing in (§2.5). Although it is illegal to pay children directly in many countries, the
caregiver should be compensated for the child’s time (and no doubt their own). If
payment is inappropriate you may contribute to the children in other ways such as
by assisting with educational costs, or providing home goods (bedding etc.), food
23
or toys. Your university may have policies about what they will and won’t cover, so
make sure you talk to the right people before such arrangements are made.
Archiving and access also requires extra consideration in language acquisition
projects. Videos may contain images of naked children, a normal everyday occur-
rence in some communities and climates, but nevertheless something which should
not be available via the internet, even in password protected archives. In many
communities, caregivers may only wish for the audio or the transcripts to be avail-
able (and not the video), even where there is nothing compromising on the videos.
Other requests from caregivers for anonymity, gossip to be deleted etc. should be
abided by, as is the case for all recordings.
bringing additional disease to the community. Make sure that you are healthy when
you set off on fieldwork, and ensure that you have had all relevant vaccines such as
flu shots, diphtheria, cholera etc. (§2.7). Also be aware of your health in the field.
Something as minor as a strep throat in an adult can develop complications, such as
rheumatic heart fever, in a child.
Other children are members of minority groups within colonised countries.
Aboriginal communities and Indian reservations in Australia, Canada and the
United States have First World conditions in many respects i.e. they have hous-
ing, electricity, access to health clinics, and have high rates of immunisation for
many childhood illnesses such as measles, mumps and rubella (MMR), Hepatitis
B, flu and whooping cough. However, in many other respects, Indigenous peoples
in colonised countries live in Third World conditions. For example, houses can
be overcrowded and sewage systems are often blocked with overuse, all of which
means that disease spreads fast. Diseases which are not seen elsewhere in these
countries may be common in these places. For example, tuberculosis has mostly
been eradicated in Australia, except in some Aboriginal communities. In the case of
children, chronic ear infections are common and can result in hearing impairment
in the crucial acquisition years (Galloway, 2008). In these cases, it is important to
test children’s hearing at various times during the project to ensure that acquisition
sequences are not being confounded by temporary deafness.
too old for an acquisition project, but they may have younger siblings. These organi-
sations may also be interested in the results of your project. Sometimes caution
needs to be taken with state-based organisations such as schools. It is worth finding
out how well integrated the organisation is in the community. In many countries, the
medium of instruction is in the official language of the country, for example in the
Chintang and Murrinhpatha communities, the schools teach in Nepali and English,
respectively. Sometimes this means there can be a tense relationship between state-
based organisations and the communities which requires some political navigation
on the part of the researcher. This approach will also typically require another layer
of institutional ethics and consent.
Now that you have established the language team and a cohort of children and
their families, the next step is to start your study. Acquisition studies fall into two
categories –(i) longitudinal studies of small cohorts of children and (ii) cross-
sectional experimental studies of large numbers of children. Both types of studies
have advantages and disadvantages and work well in tandem. Usually longitudinal
studies expose interesting areas of acquisition which can be pursued in more depth
with experimental work.
In the next section we provide an overview of the state-of-the-art in child lan-
guage acquisition studies and suggest some modifications that take into account
some of the difficulties posed by undertaking such a study in the field. Be warned
that some of the methods standard to language acquisition research are also pro-
hibitively difficult for many remote field sites. When preparing your timeline for the
study, take into account extra time for transcription, as children’s speech is much
more difficult to transcribe than adult speech. Most acquisition study methods were
developed for major languages which have large numbers of children available, are
often close to major universities with labs, and have plenty of native-speaker tran-
scribers available. Depending on your aims, you may be satisfied with a pilot-level
study, but high standards in data collection and analysis are expected by major inter-
national journals such as Journal of Child Language. This issue is discussed in detail
for studies of smaller languages in an issue of First Language (Kelly et al., 2015).
studies have the advantage of catching interesting early utterances, usually gram-
matical errors, that reveal stage-specific strategies underlying a child’s language
production. But as children become more and more talkative, it is increasingly
hard to record every utterance, which makes it impossible to conduct quanti-
tative studies. Also caregiver utterances are generally not noted down which
means that a child’s utterance is usually recorded out of context and it is not
clear what utterances are spontaneous and what are imitations. Good imitations
of adult sentences may give the impression that a child has reached a morpho-
logical milestone when they have not!
Modern longitudinal studies of language acquisition now rely on audio-visual
corpora of individual children sampled at regular intervals. Using video means that
the context of utterances is captured, including caregivers’ prior utterances and the
visual field of the child. Many corpora have been developed using CLAN (see §4.4
and Carmel O’Shannessy’s Light Warlpiri project in §9.4.2) and are now held in the
CHILDES database, although the Chintang and Murrinhpatha projects discussed
in §9.1 and §9.3.3 use ELAN (see §4.3.2).
Developing a corpus is a huge undertaking and there are many factors to take
into consideration to produce a representative corpus. Stoll (2015) cautions the
researcher to pay particular attention to (i) choosing representative children
(number, age span), (ii) the context of recording (place, number and type of
people present in the recordings), and (iii) the intervals between recordings and
their length. We will look at each of these in turn and also discuss issues of
transcription.
Bear in mind that there will be some natural attrition over the course of the pro-
ject so this means you need to over-recruit at the beginning of the project. Another
issue lies in representativeness. Even when you choose a cohort of six children,
how do you know that these children are representative of the population? Stoll &
Gries (2009) developed a method which involves conducting a series of incremental
comparisons between a child and his/her caregivers. This method can be applied as
the corpus is being collected, and it gives you a sense of whether you have the right
children in your sample.
study of child rearing was conducted first which helped determine who the chil-
dren interacted with the most and in what contexts. Similar to the Murrinhpatha
study, large differences between these and more familiar Western contexts were
observed, for example in the amount of child-directed speech, the demographics
of the interlocutors, the fact that adults did not play with children, and the paucity
of toys (Lieven & Stoll, 2013).
Once you have established the best way to record the children, you can now
get underway with setting up your equipment. In general, the specifications for
video (§3.2.5) and audio (§3.2.1) settings should be followed. In the case of the
Murrinhpatha project, children were recorded using two video cameras (one on
the general scene and another one on the focus children). One camera had a
shotgun microphone and the other a wireless microphone receiver (§3.2.2). The
children wore wireless microphones in backpacks. It was possible to mic up two
focus children at a time, with one lapel microphone recording through the left
channel of the stereo audio track and the other through the right channel. The
receivers were then ‘mounted’ on the camera using zip ties (because these cam-
eras typically only have one shoe). The boom mic was connected to the camera
via an XLR cable through the R channel. This camera also had a shotgun micro-
phone mounted on it running through the L channel. The backpacks were neces-
sary because Murrinhpatha children don’t always wear shirts due to the constant
heat. The backpacks meant good audio was recorded and children could roam
freely because the wireless microphones had a 50 metre radius. Of course record-
ing in this way is data hungry, requiring a lot of available storage and a good
workflow to keep track of all of the data ‘objects’ associated with each recording
session (see §4.2.1).
An alternative is the Chintang set-up which has one camera with a ‘fish-eye’ lens
and a shotgun microphone which produces just one media ‘object’ and requires less
initial set-up time. There is also no need to set up radio microphones, which require
level checks and balancing. All of this can be time-consuming and difficult for a
solo researcher. The ‘fish-eye’ lens captures much of the action, and is an alternative
to an additional roaming camera for active children.
If there is no electricity available, recording devices which use disposable batter-
ies can be used. No video cameras use disposable batteries, but some audio record-
ers do. For roaming children, the device can be put in a backpack or bumbag (fanny
29
pack) and worn by the child. In the worst case scenario, a diary style study can
be undertaken (see §9.4.1). This sort of study might be effective in field situations
where it is common for the researcher to live with a family, such as in Melanesia or
Papua New Guinea.
The Chintang and Murrinhpatha projects did not adopt experimental meth-
ods, but one good example of their use in a field study of acquisition comes from
O’Shannessy’s (2011) study of the acquisition of argument disambiguation in Light
Warlpiri.
Figure 9.1
A ‘picture-choice’ task used with Warlpiri children
23
Figure 9.2
A scene from ‘The Guitar Story’
Source: O’Shannessy, 2004
9.5 SUMMARY
There is currently much scope for acquisition studies of lesser known languages to
make major contributions to acquisition theory, and linguistic theory more broadly.
Understanding acquisition provides a crucial piece of the puzzle of the human lan-
guage capacity. Acquisition projects can also contribute a lot to a speech commu-
nity where the target language is endangered. For example, projects of this kind can
help understand which parts of the language to target in revitalisation efforts.
For a general discussion of field methods for acquisition, see Eisenbeiss (2006) and
Kelly, Forshaw et al. (2015). For discussions on standards for corpus building, see
Stoll (2015). For good descriptions of field studies of acquisition, see Ochs (1988)
on Samoan, Brown (1998, 2001) on Tzeltal, Demuth (1992) on Sesotho, and Allen
(1998, 2013) on Inuktitut. For a good overview of experimental methods used in
acquisition research, see Ambridge & Rowland (2013), O’Shannessy (2013) and
Blom & Unsworth (2010).
NOTES
REFERENCES
Allen, S. (1998). Categories within the verb category: Learning the causative in Inuktitut.
Linguistics, 36, 633–677.
Allen, S. (2013). The acquisition of ergativity in Inuktitut. In E. Bavin & S. Stoll (Eds.), The
acquisition of ergativity (pp. 71–105). Amsterdam: John Benjamins.
Allen, S., & Crago, M. (1996). Early passive acquisition in Inuktitut. Journal of Child Language,
23(1), 129–155.
Ambridge, B., & Lieven, E. (2011). Child language acquisition: Contrasting theoretical
approaches. Cambridge: Cambridge University Press.
Ambridge, B., & Rowland, C. (2013). Experimental methods in studying child language
acquisition. Wiley Interdisciplinary Review of Cognitive Science, 4(2), 149–168.
Bates, E., Dale, P., & Thal, D. (1996). Individual differences and their implications for theo-
ries of language development. In P. Fletcher & B. Macwhinney (Eds.), Handbook of child
language (pp. 96–151). Oxford: Blackwell.
Berko, J. (1958). The child’s learning of English morphology. Word, 150–177.
Berman, R., & Slobin, D. (1994). Relating events in narrative: A crosslinguistic developmental
study. Hillsdale, NJ: Lawrence Erlbaum.
Blom, E., & Unsworth, S. (2010). Experimental methods in language acquisition research.
Amsterdam: John Benjamins.
Brown, P. (1998). Children’s first verbs in Tzeltal: Evidence for an early verb category.
Linguistics, 36(4), 713–753.
Brown, P. (2001). Learning to talk about UP and DOWN in Tzeltal: Is there a language-
specific bias for verb learning. In M. Bowerman & S. Levinson (Eds.), Language acquisi-
tion and conceptual development (pp. 512–543). Cambridge: Cambridge University Press.
Chafe, W. (Ed.). (1980). The pear stories: Cognitive, cultural, and linguistic aspects of narrative
production. Norwood, NJ: Ablex.
Chomsky, N. (1981). Principles and parameters in syntactic theory. In N. Hornstein, &
D. Lightfoot (Eds.), Explanation in linguistics: The logical problem of language acquisition
(pp. 32–75) London: Longman.
Demuth, K. (1992). Acquisition of Sesotho. In D. Slobin (Ed.), The cross-linguistic study of
language acquisition (Vol. 3, pp. 557–638). Hillsdale, NJ: Lawrence Erlbaum.
Eimas, P., Siqueland, E., Jusczyk, P., & Vigorito, J. (1971). Speech perception in infants.
Science, 171, 303–306.
Eisenbeiss, S. (2006). Documenting child language. Language Documentation and
Description, 3, 106–140.
Fenson, L., Dale, P., Reznick, S., Bates, E., Thal, D., Pethick, S. (1994). Variability in early com-
municative development. Monographs of the Society for Research in Child Development,
59, 1–185.
Fitch, T., Hauser, M., & Chomsky, N. (2005). The evolution of the language faculty:
Clarifications and implications. Cognition, 97, 179–210.
Forshaw, W. (2016). Little kids, big verbs: The acquisition of Murrinhpatha bipartite stem verbs.
PhD Thesis. University of Melbourne.
236
Galloway, A. (2008). Indigenous children and conductive hearing loss. In J. Simpson &
G. Wigglesworth (Eds.), Children’s language and multilingualism: Indigenous language
use at home and school (pp. 216–234). New York, NY: Continuum.
Gentner, D. (1982). Why nouns are learned before verbs: Linguistics relativity versus natural
partitioning. In K. Hirsh-Pasek & R. Golinkoff (Eds.), Action meets word: How children
learn verbs (pp. 544–564). Oxford: Oxford University Press.
Ingram, D. (1989). First language acquisition: Method, description and explanation.
Cambridge: Cambridge University Press.
Kelly, B., Forshaw, W., Nordlinger, R., & Wigglesworth, G. (2015). Linguistic diversity in first
language acquisition research: Moving beyond the challenges. First Language, 35(4–5),
286–304.
Kelly, B., Kidd, E., Wigglesworth, G., Forshaw, W., Nordlinger, R., O’Shannessy, C. (2015).
Special Issue: Indigenous children’s language: Acquisition, preservation and evolution of
language in minority contexts. First Language, 35, 279–285.
Lieven, E., & Stoll, S. (2013). Early communicative development in two cultures: A com-
parison of the communicative environments of children from two cultures. Human
Development, 56, 178–206.
Meakins, F., & O’Shannessy, C. (2010). Ordering arguments about: Word order and discourse
motivations in the development and use of the ergative marker in two Australian mixed
languages. Lingua, 120(7), 1693–1713.
Nordlinger, R. (2015). Mapping morphological complexity in Murrinhpatha. Presented at
the Linguistics Association of Great Britain conference, UCL, London.
Ochs, E. (1988). Culture and language development: Language acquisition and language
socialization in a Samoan village. Cambridge: Cambridge University Press.
O’Shannessy, C. (2004). The monster stories: Picture stimulii to elicit lexical subject NPs.
Nijmegen: Max Planck Institute for Psycholinguistics.
O’Shannessy, C. (2011). Competition between word order and case-marking in interpret-
ing grammatical relations: A case study in multilingual acquisition. Journal of Child
Language, 38(4), 763–792.
O’Shannessy, C. (2013). Methods in researching children’s acquisition of sociolinguistic com-
petence. In J. Holmes & K. Hazen (Eds.), Research methods in sociolinguistics: A practical
guide (pp. 304–324). Chichester, England: John Wiley & Sons.
Pinker, S. (1984). Language learnability and language development. Cambridge, MA: Harvard
University Press.
Pinker, S., & Ullman, M. (2002). The past and future of the past tense. Trends in Cognitive
Sciences, 6(11), 456–463.
Rice, M. (2010). Mean length of utterance levels in 6-month intervals for children 3 to
9 years with and without language impairments. Journal of Speech, Language and Hearing
Research, 53(2), 333–349.
Rowland, C. (2013). Understanding child language acquisition. Abingdon, England: Routledge.
Roy, D. (2009). New horizons in the study of child language acquisition. Proceedings of
Interspeech 2009, Brighton (England).
Slobin, D. (1982). Universal and particular in acquisition. In E. Wanner & L. Gleitman
(Eds.), Language acquisition: The state of the art (pp. 128–170). Cambridge: Cambridge
University Press.
Slobin, D. (2004). The many ways to search for a frog: Linguistic typology and the expression
of motion events. In S. Strömqvist & L. Verhoeven (Eds.), Relating events in narrative
(Vol. 2, pp. 219–257). Hillsdale, NJ: Lawrence Erlbaum.
237
10
Contact languages
10.1 INTRODUCTION
Contact languages such as creole varieties can be studied like any other language.
They have their own phonology, lexicon and grammatical rules so the field linguist
can create a corpus of the language, compile a dictionary or write a grammar. But
there are also specific questions that relate to contact languages which make them
an interesting object of study in and of themselves. For example, is simplification
or morphological reduction a feature of language contact; what is the respective
role of universal constraints, typological (mis)matches and community norms on
language mixing; and how does the degree of bilingualism in a community or indi-
vidual affect contact outcomes? And of course language contact is not restricted
to spoken language, but also occurs in other modalities such as sign language (see
Chapter 8). This chapter discusses special considerations that field linguists should
think about before embarking on documenting a contact language (§10.3) and
methods particular to these linguistic varieties (§10.4).
DOI: 10.4324/9780203701294-10
This chapter has been made available under a CC-BY-NC-ND license.
239
‘young people’s X’ or ‘our X’, and sometimes these reflect their low status or lack
of recognition as a distinct language, for example, ‘rubbish talk’, ‘light X’, ‘lingo’ or
‘patois’ (see §10.3.7 for a discussion of naming practices).
Some linguists have also perpetuated the view of community leaders or the
regional majority by not considering contact languages worthy objects of linguis-
tic research. This is despite the fact that they are often the dominant language of
a speech community! Garrett’s (2009) paper ‘Contact languages as “endangered
languages”: What is there to lose?’ argues that contact varieties have been largely
invisible to documentary linguistics. Indeed discussions of them in field methods
courses often involve techniques for avoiding confusing structures in semi-speaker
or contact varieties for the ‘real’ grammar. Archives are also full of recordings with
metadata about heritage languages under investigation, but with no record of con-
tact languages, despite the fact that they are often the lingua franca of recording
sessions.
processes may have shaped languages historically (where often these processes
require reconstruction using historical linguistic methods, e.g. the comparative
method). Understanding contemporary sociophonetic or grammatical variability
may also help linguists map deeper historical changes which led to splits in family
trees, i.e. linguistic diversification. Studying languages in contact can also teach us
more about synchronic linguistic structures. For example what is permissible in
borrowing or code-switching often relies on the respective typological match of the
languages in contact. Unexpected switches or borrowings may lead to the reanalysis
of a linguistic structure.
Pidgins are not used as the native language of a speech community, but function as
a lingua franca between a number of groups, which otherwise do not have a shared
241
Romani-Turkish code-switching
The Romani community of Thrace in Greece is a trilingual community which
speaks Romani (Indo-Aryan), Turkish (Turkic) and Greek (Hellenic). The
Roma had arrived in the Byzantine Empire by 1200, now Turkey and Greece
(Matras, 2002). At present, approximately 200,000 Roma live in Greece
(Bakker, 2001). Thrace Romani is under threat by Turkish and Greek to dif-
ferent degrees in different places. Although it is still learnt by children in some
areas, language shift can be seen in the parent generation. All children have
243
The corpus was collected through storytelling interviews with the field lin-
guist Evangelia Adamou, and in-group conversations between the Roma
participants. This productive data was further investigated experimentally
using reaction-time tasks to determine whether mixed Romani-Turkish sen-
tences have higher processing costs than unilingual L2-Turkish sentences
(Adamou & Shen, 2018).
The documentation of Gurindji Kriol has occurred in tandem with the docu-
mentation of Gurindji. Over a number of projects an 165-hour (405,027 words)
annotated sound-linked corpus of Gurindji Kriol now exists which represents
language from 157 speakers (Meakins, 2013). The corpus is transcribed in
CLAN (§4.3.2) and much of it is housed in ELAR and PARADISEC (§4.4).2
other cases, a shift to the regionally dominant language goes to completion and the
heritage language is no longer used. This obsolescence process often occurs across
different domains of discourse at different rates. For example, often ritual contexts
are the last domain where an endangered language is still spoken. The use of Latin
in churches in Germany before Martin Luther wrote the Deutsche Messe is one
example.
Language shift can be mapped across a generational cline. While older speakers
often speak a conservative variety of the language, innovations are present in the
speech of younger members of the community. Differences between the speech of
generations are often described as mistakes or errors, and linguists generally reflect
this sentiment labelling younger speakers as partial or semi-speakers. Some lan-
guages which have been examined in detail are East Sutherland Gaelic spoken in
Scotland (Dorian, 1978), Pennsylvania and Texas German spoken in the US (Boas,
2009), Oberwart Hungarian spoken in Austria (Gal, 1979) and Dyirbal spoken in
north Queensland (Australia) (Schmidt, 1985).
Innovations in language shift varieties are often characterised by allomorphic
reduction, the simplification of grammatical systems, syncretism in inflectional
paradigms (case, gender, verb conjugations etc.) and synthetic systems becoming
analytic (e.g. case being replaced by prepositions). These innovations can be the
result of internal changes which may be accelerated in obsolescence situations or
language contact with a socially dominant language. More recently, this approach
to characterising language shift has been challenged with a variety of complexi-
fication processes identified as being responsible for the development of these
varieties.
The usual fieldwork protocols around the roles and responsibilities of fieldworkers
within a speech community and workflow considerations such as equipment and
software choice, and archiving also apply to contact languages. You can read about
these issues in Chapters 2–4. But there are also specific considerations for docu-
menting contact languages in the field which we discuss here.
246
have multiple ways of expressing the same concept. Even well-established lan-
guages show variation, as sociolinguists such as William Labov (1994b, 1994a)
have pointed out extensively! For example, the comparative in English can be
expressed using the suffix -er or the adverb more, but also by double marking e.g.
more happier. The choice depends on a combination of phonology (the number
of syllables in the stem) and social status (double-marking is associated with
lower socio-economic classes and younger people). To truly capture the lan-
guage, variation and what drives the variation (social, discourse-level or gram-
matical factors) need to be documented.
In the end, if language shift is not treated differently from other types of language
change (not driven by loss), then similar patterns and processes of change should
be expected. For further discussions of documenting variation in the field, see
Meyerhoff et al. (2012).
248
We listen to the kids, you know, and they don’t talk properly. For example, they are always
saying ‘nyawangkirri’ for ‘that way’. They always say ‘nyawangkirri’ not ‘murlangkurra’
which is wrong. You should say ‘murlangkurra’. And they also say ‘pinka-kirri’ for ‘to
the river’ which is wrong. They should say ‘pinka-kurra’. ‘Nyawangkirri’ is not proper
Gurindji. We listen to the kids and they don’t talk as well as I am talking to you.
(Meakins, 2010, p. 230)
Younger people, who are the main speakers of Gurindji Kriol, were initially reticent
to discuss their speech style, aware that they did not speak Gurindji in a tradi-
tional manner. Even so Gurindji Kriol has a lot of covert prestige among its speakers
and they did want it documented. The solution was to split the documentation of
Gurindji and Gurindji Kriol between the two Gurindji communities. The docu-
mentation of Gurindji was focused at Kalkaringi because the majority of speakers
lived here, and Gurindji Kriol work occurred 8 kilometres away at Daguragu, which
has a younger age profile. It meant that the young people at Daguragu were able to
work on their language without facing criticism from their community leaders.
whose language skills are not considered noteworthy by the community because
the new variety has little status in the community can be both socially inappropri-
ate and linguistically inexplicable from a community perspective. Related problems
occur in language acquisition for male researchers (see §9.3.5).
This was an issue for both the Greek Thrace project and Gurindji Kriol project.
In the Greek Thrace project, Adamou mainly worked with female speakers and was
under constant supervision when working with male speakers of her own age. In
the case of Gurindji Kriol, no adult male speakers are represented in the data, only
boys under the age of 14, a result of the fact that it was culturally inappropriate for
Meakins, a (then) young woman, to work alone with young men. Working with
young men would have been disapproved of because it is unusual for a woman to
spend much time alone in a group of men.
Rhydwen (1995, p. 117) goes on to suggest that the status of Kriol is similar to the
status of other ethnolects of English, such as Black American English, which are in-
group languages that express minority identities.
The elicitation methods standard to grammar writing projects (§6.5) are prob-
lematic for characterising contact languages for a number of reasons discussed
in §10.4.1. Instead the well-worn path of corpus development (§10.4.2) followed
by targeted (semi-)experimental elicitation and comprehension tasks (§10.4.5) is
beginning to become standard in field-based language contact projects. This path
was taken by both the Greek Thrace and Gurindji Kriol projects. It was pioneered
by language acquisition (see §9.4.2) and has become common in other areas such
as semantic typology.
Peer elicitation tasks also have the advantage of allowing the language team to
design tasks which target particular linguistic structures and can be performed with
a large number of people to account for variation. To go back to the example of
optional nominative case-marking in Gurindji Kriol. Nominative marking attaches
to nouns which are optional. Conversation and narrative data alone did not pro-
vide sufficient numbers of nouns to map the parameters of variation of nomina-
tive marking within Gurindji Kriol, and to conduct statistical tests of significance.
Therefore peer elicitation tasks were used to supplement the data set.
254
Figure 10.1
Cassandra Algy Nimarra and Felicity Meakins record director-matcher tasks with Jamieisha Barry
Nangala, Regina Crowson Nangari and Quitayah Frith Namija (Photo: Jennifer Green 2017)
25
microphones linked to the two devices. A wireless lapel mic links to the camera to
ensure good recording levels from shy participants. Felicity notes down the number
of each picture in turn. This annotation is included in the transcript which is done
using CLAN.
Finally, like director-matcher tasks and card games, picture-prompt books produce
comparative data necessary in situations where variation is common.
The most successful books contain only pictures (no words!) to avoid stilted
translations, literacy concerns or indeed spontaneous language contact influence
from a regionally dominant language. Participants who have been to school are
often more comfortable with these books than small children or older speakers who
tended to skip backwards and forwards between pages and generally did not associ-
ate pictures in a clear linear fashion. Like other peer elicitation activities, picture-
prompt books work well with an audience of peers.
A number of picture-prompt books were used to elicit Gurindji Kriol narratives.
These books included the Monster book series created by Carmel O’Shannessy
(2004) for her work on Light Warlpiri (§9.4.2). Other books are also useful for
studying specific structures. Frog, where are you? (Mayer, 1994) has been used in
numerous studies of information structure and spatial relations such as motion
constructions. Using Frog stories provides not only a comparative dataset within
the Gurindji Kriol speaking community, but also can be applied more broadly.
Despite not being a Gurindji-focused story, adults and children alike enjoyed using
the Frog stories immensely. A variation on picture-prompt books is the use of short
word-less videos such as Wallace Chafe’s Pear Story video.5
language situation, it might be sufficient to run a small study with a few people. If
you are planning to write a school curriculum in your language, broader grammati-
cal description is required and, with it, larger numbers of speakers.
If there is a lot of variation present, you will be faced with needing large numbers
of speakers. The number of speakers may depend on the complexity of the linguis-
tic feature you are interested in. How many different ways can it be expressed and
how many different factors drive the choice between variables? For example, in the
case of optional nominative marking in Gurindji Kriol, the choice to use or not
use the nominative marker depends on word order, whether the speaker has used
the marker before, whether an event has occurred or not, whether a co-referential
pronoun is also present, among other factors. This study used 6,550 tokens from
103 speakers to model the variation (Meakins & Wilmoth, 2018). Meyerhoff et al.
(2012) have a good description of how to calculate how many speakers you need for
a study on variation.
For experimental work, see §10.4.5, usually 12 people per category is consid-
ered reasonable (i.e. 12 male, 12 female; or 12 adults, 12 children etc.), and at least
30 tokens of each variant per factor. The Romani study of reaction time and lan-
guage switching which aimed to determine whether mixed Romani-Turkish sen-
tences have higher processing costs than unilingual L2-Turkish sentences used 49
participants in one experiment and 37 in the other. Ten had attended at most pri-
mary school, 27 had attended secondary school and above; 23 participants were
female and 14 male (Adamou & Shen, 2018). See also §[Link] for a discussion
of numbers of participants and deliberate over-recruiting in studies of language
acquisition.
involved 105 participants) and Stewart et al’s (2018) study of voicing contrasts in
stops (which had 103 participants), attained education level was used as a proxy for
English proficiency in place of testing.
10.5 SUMMARY
Garrett (2009) has a good discussion about why it is important to document contact
languages. Winford (2003), Poplack (2015) and Meakins (2016) have good over-
views of pidgin and creole languages, code-switching and mixed languages respec-
tively. Meyerhoff et al. (2012) provide good guidelines for dealing with variation in
fieldwork.
NOTES
1 The sega is both music and dance and is at the core of the folklore of Mauritius. It origi-
nates from Africa and was introduced in Mauritius in the eighteenth century.
2 [Link] Accessed 31 August 2017.
3 [Link]/holdings/canadian-[Link] Accessed 31 August 2017.
4 [Link] Accessed 31 August 2017.
5 [Link] Accessed 31 August 2017.
REFERENCES
Adamou, E., & Granqvist, K. (2015). Unevenly mixed Romani languages. International
Journal of Bilingualism, 19(5), 525–547.
Adamou, E., & Shen, R. (2018). There are no language switching costs when codeswitching is
frequent. International Journal of Bilingualism.
259
Bakker, P. (2001). Romani in Europe. In G. Extra & D. Gorder (Eds.), The other languages
of Europe: Demographic, sociolinguistic and educational perspectives (pp. 293– 313).
Cleveland, OH: Multilingual Matters.
Boas, H. (2009). The life and death of Texas German. Durham, NC: Duke University Press.
Croft, W. (2000). Explaining language change: An evolutionary approach. Harlow, England:
Longman.
Dorian, N. (1978). The fate of morphological complexity in language death: Evidence from
East Sutherland Gaelic. Language, 54(3), 590–609.
Dorian, N. (1989). Investigating obsolescence: Studies in language contraction and death.
Cambridge: Cambridge University Press.
Gal, S. (1979). Language shift: Social determinants of linguistic change in bilingual Austria.
San Francisco, CA: Academic Press.
Gardani, F. (2008). Borrowing of inflectional morphemes in language contact. Frankfurt:
Peter Lang.
Garrett, P. (2009). Contact languages as “endangered languages”: What is there to lose?
Journal of Pidgin and Creole Languages, 21(1), 175–190.
Guillemin, D. (2011). The syntax and semantics of a determiner system: A case study of
Mauritian creole. Amsterdam: John Benjamins.
Labov, W. (1994a). Principles of linguistic change: Internal factors. Oxford: Blackwell.
Labov, W. (1994b). Principles of linguistic change: Social factors. Oxford: Blackwell.
Lipski, J. (2016). Language switching constraints: More than syntax? Data from Media
Lengua. Bilingualism: Language and Cognition, 20(4), 722–746.
Matras, Y. (2002). Romani: A linguistic introduction. Cambridge: Cambridge University Press.
Matras, Y., & Sakel, J. (Eds.). (2007). Grammatical borrowing in cross-linguistic perspective.
Berlin: Mouton de Gruyter.
Mayer, M. (1994). Frog, where are you? China: Puffin.
Meakins, F. (2010). The importance of understanding language ecologies for revitalisation. In
J. Hobson, K. Lowe, S. Poetsch, & M. Walsh (Eds.), Re-awakening languages: Theory and
practice in the revitalisation of Australia’s Indigenous languages (pp. 225–239). Sydney:
Sydney University Press.
Meakins, F. (2012). Which mix? Code-switching or a mixed language –Gurindji Kriol.
Journal of Pidgin and Creole Languages, 27(1), 105–140.
Meakins, F. (2013). Gurindji Kriol. In S. Michaelis, P. Maurer, M. Haspelmath, & M. Huber
(Eds.), The survey of pidgin and creole languages (Vol. 3, pp. 131–139). Oxford: Oxford
University Press.
Meakins, F. (2018). Mixed languages. In M. Aronoff (Ed.), Oxford research encyclopedia of
linguistics. Oxford: Oxford University Press.
Meakins, F., Jones, C., & Algy, C. (2016). Bilingualism, language shift and the corresponding
expansion of spatial cognitive systems. Language Sciences, 54, 1–13.
Meakins, F., & Wilmoth, S. (2018). Complex cell-mates: Morphological overabundance
resulting from language contact. In P. Arkadiev & F. Gardani (Eds.), Morphological
Complexity. Oxford: Oxford University Press.
Meyerhoff, M., Adachi, C., Nanvakhsh, G., & Strycharz, A. (2012). Sociolinguistic field-
work. In N. Thieberger (Ed.), The Oxford handbook of linguistic fieldwork (pp. 121–146).
Oxford: Oxford University Press.
Moravcsik, E. (1978). Universals of language contact. In J. Greenberg (Ed.), Universals
of human language, Vol. 1: Method and theory (pp. 95–122). Stanford, CA: Stanford
University Press.
260
11
Verbal art
11.1 INTRODUCTION
A great thing about fieldwork is the variety of language use you are exposed to
in the speech community. One of the most beautiful of these uses is song. All
cultures have some form of poetry or song that may be performed musically or
recited in a variety of ways. These tend to be stored in people’s memory, even in
literate cultures, and often in very large quantities. Poetry and song are exam-
ples of what are called verbal art, oral literature or the temporal arts (Barwick,
2012). Other types of verbal art include litany, children’s rhyming games, sung
tales, poems, ritual speech events, advertising jingles, incantations, charms and
spells. In many cultures, the distinction between poetry and song does not exist.
As such, musicologists tend to regard poetry and song as points on a continuum
rather than as discrete categories.
Verbal art can be thought of as intensively structured use of language. By inten-
sively structured we mean it is subject to structures that are not typical of everyday
speech. The occurrence of particular sounds, grammatical structures or meanings
can all be regulated in verbal art. Typically, these elements are arranged with respect
to a line, e.g. line-final rhyme. In contrast, units of speech can be of any length and
need not even be made up of lines. By drawing on the concept of a line, verbal
art can make use of other poetic features such as alliteration (the occurrence of
the same sound in the same position of a word or line) and parallelism (repetition
with minimal variation). Parallelism is exemplified in the following two lines from
the verbal art form called tom yaya of the Papua New Guinea Highlands (Rumsey,
2011, p. 252):
These two lines show parallelism: repetition with variation occurring in the same
position of the line (variation is in italics). The variation here occurs within the
same semantic domains. The example also shows another feature of much verbal
art: the use of ideophones such as ‘buzz’ and ‘drone’. In these ways, verbal art
draws attention to its own form, what Jakobson (1960) calls the “poetic function”
of language. This contrasts with everyday speech which gives priority to the com-
municative function of language. Artistic structures such as rhyme and parallel-
ism can also turn up in speech play or daily conversation where it may invoke the
DOI: 10.4324/9780203701294-11
This chapter has been made available under a CC-BY-NC-ND license.
26
context of formal oratory; however, this chapter limits the discussion to verbal
arts, which are often named categories or genres in a speech community.
In this chapter, we discuss how to do fieldwork on verbal arts by taking song as
an exemplar. Songs are often the property of whole communities, although some
people may be recognised as song experts. In some cultures there are types of
verbal art composed by gifted individuals that may be appreciated by some, but
not all, members of the community. Other types are composed by ordinary peo-
ple, and transmitted orally among the population, as is the case with folk songs.
In the early 1900s folklorist Cecil Sharp (1973) visited Appalachia to record folk
songs and found that there, unlike in England, everyone sang folk songs, young
and old. Across cultures we find that songs play a role in imparting the values
of society. Some are shared, traded or performed for profit; others are a part
of religious, healing or shamanic practices and some are simply performed for
entertainment. In many cultures, verbal arts incorporate dance, ritual action and
the visual arts. In this context, it is more accurate to talk about ‘performance
arts’ rather than just ‘verbal arts’. A multi-disciplinary team of researchers is very
useful when working on such performance arts. In this chapter we present the
musicologist’s wish list to ensure that the linguist’s recordings are also useful for
musical analysis (Barwick, 2006, 2012).
There are many reasons why verbal art should be included in your linguistic
fieldwork:
1. Song is an instance of both language and music that depends on shared
human capacities. To understand the human capacity for language, we must
also understand the human capacity for music.
2. As with music, the performance of verbal art is often a very pleasurable
experience.
3. Verbal art forms are often seen as the pinnacle of a culture’s linguistic and
musical achievements and so the community may wish to showcase these to
outsiders (Evans, 2009).
4. The benefits and uses of recordings of verbal arts may be much more readily
apparent to the speech community than recordings of speech. The opportu-
nity to record songs can thus be a great way to build relationships and give
back to the community something of great worth.
5. Songs may be the sole means of transmitting important social, cultural and
historical information and reproducing the knowledge and norms of soci-
ety. They can be a mirror of community concerns and values, and can codify
emotional responses to objects and events. Just think of the Portuguese con-
cept of saudade, a deep longing for an absent something or someone, which
permeates the Portuguese song genre fado.
263
Before going into the field, find out if there is any prior documentation of the verbal
arts in the community or language region. Remember that anthropologists, folklor-
ists, musicologists, as well as linguists, may have recorded or described verbal art.
Watch or listen to any past recordings from the region. Keep in mind, however, that
the songs of today may be different to those of the past, as innovation and creativity
typically drive musical change. It is not uncommon for the songs of younger and
older generations to differ, with each group preferring their own songs or versions
264
In some cultures, fees or gifts are paid to the performers or copyright holders
for recording songs. If this is the case, talk to someone in the community about
what the appropriate fee or gift might be and who should be remunerated. It may
be that only the composer or certain people in the society can perform, talk about,
or give permission for songs to be recorded or written about. Listen to the advice
of community members as to who you should or shouldn’t record. It is much bet-
ter to have other community members present when recording verbal art. The
exchanges between performer and community members often lead to a much
richer event. Community members can also be helpful intermediaries between you
and the performer, paving the way for a smooth and enjoyable event. Additionally,
community members will be better placed to work on transcriptions and transla-
tions, having participated in the recording event themselves. If possible, work with
a community member to assist in negotiating the appropriate payments or gifts
(if required), ensuring permissions are in place, and running and recording the
performance event.
If possible, play back your recording of the song soon after the performance has
finished. Not only do singers often ask to hear the recording straight away, but this
is an excellent opportunity to get an explanation of what the song means as well as
commentary, corrections, additions or reflections on the performance. Of course,
if the performance involves lots of people and takes days to perform, then it is not
going to be possible to play the whole recording.
Take a pair of high quality external speakers for this purpose. It is much bet-
ter to play the sound through these than through the inbuilt speakers of your
268
recording device. If possible, transfer the recording to a hard drive first (this is
not always possible). That way there is no risk of accidentally deleting the only
copy of the recording as you play it back on your recording device. Furthermore,
if you can play transferred files on another device it means that you can record
the explanation and discussion of the songs as well. Alternatively you may
have a second recording device that you can use to record the explanation and
discussion.
As the performance is underway, you should keep notes on the number of songs
and try to identify any words or structures, such as lines. This will help you to play
back the songs to the singer and find particular songs they may wish to comment
on. When playing back the recording or reading your notes, go through one line at
a time, or use some other natural division of the song, to clarify form and meaning.
If the practice is to repeat a song or line multiple times (but possibly in different
musical styles), it may annoy the singer to be asked “What does that mean” when
it is actually the same line that they have just explained! Listen carefully and take
notes while recording, as this will help you distinguish repetition from new lyrics.
The task of transcribing and translating songs can be very rewarding due to
their stimulating intellectual content, and their linguistic and musical artistry.
Keep in mind, however, that in some cultures the right to translate and work
on songs may be dependent on a person’s role in society or their relationship to
specific songs.
In addition to any social restrictions on who can work on songs, in some cases
the style in which verbal art is delivered makes transcription difficult. For example,
in soft chanting or non-unison group singing it can be very difficult –if not impos-
sible –to determine the linguistic content. In addition, the gap between lexical con-
tent of individual words and the meaning conveyed by these words put together in
a song can be vast. Remember that in verbal arts the referential message may be
downplayed in favour of formal or aesthetic dimensions, which may be in the style
or emotional force of the delivery. Obtaining a spoken explanation or a ‘spoken lan-
guage’ version of the songs at the time of performance will greatly assist in the task
of transcription and translation.
At the early stages of linguistic transcription, it is a good idea to distinguish the
units of the performance, such as a song or poem, from the method of its delivery,
such as singing, chanting or speaking. It is possible that people may at times speak
the song and sing speech, and thus your transcript should be able to distinguish
song lyrics from an everyday word or phrase, and distinguish singing or intoning
from speaking. A transcription of a Kaytetye text illustrating the difference between
sung and spoken song is shown in Figure 11.1 (Green & Turpin, 2013, pp. 384, 385).
This excerpt ends in a portion of singing, labelled ‘(singing)’ consisting of a song
text with two lines (line 1 and line 2). Before this, each of the two lines is spoken,
where they are used to refer to the devil character in the story. Without paying
269
Figure 11.1
A text illustrating the difference between sung and spoken song texts
Source: Green & Turpin, 2013, pp. 384, 385
Selections from a transcript of a text by Kaytetye man Tommy Thompson showing the difference
between song and speech, and between sung and spoken delivery.
\t 119.767
\k Nyarte re two time ‘Angkwerey-angkwereye’=ye, nyarte.
\g this 3sg twice [Link]-RED(LINE 1)=EMPH this
\e It went around twice (singing) ‘Big sister, big sister’ (LINE 1).
…
\t 184.243
\k Him stand up here aylengke are-yayne.
\g stand up here 1duSMSG see-PST:CNT
\e ‘It stopped here and looked at us two!’
\t 184.243
\k Kwere=lke ampile-yayne, ‘Artwarlatyawerlewe’.
\g 3sgDAT=now follow-PST:CNT ‘devil=REL-1SG-hear-PST(LINE 2)
\e They followed the tracks (of) ‘I heard a devil (LINE 2)’.
\t 199.396
\k (singing): ‘Angkwereye angkwereye, Artwarlatyawerlewe’.
\g LINE 1 LINE 2
\e ‘Big sister, big sister, I heard a devil’
attention to the difference between song texts and speech words it would be difficult
to understand the meaning of the spoken text.
At this early stage in transcription, it is a good idea to label songs sequentially
rather than by name, unless you already know what distinguishes one song from
another. For example, you may think you have two different songs when in fact it
is the same song simply commencing at a different place (think how some people
may begin a folk song with the chorus while others with the verse). Keep an open
mind about what distinguishes one song from another within the culture; it may
not be in features of the text, but rather the music, meter or performance context.
Be careful when attributing songs to a single genre (e.g. wedding song, funeral
song). In some cases a song, as a musical-linguistic form, may not belong uniquely
to a single genre. In one context it may be a wedding song, but in another context
a healing song.
Verbal arts are often delivered in a way that differs from speech (e.g. sung,
whispered, chanted) and so it is a good idea to keep the transcription of the
performed song separate from the transcription of how people explain or ‘speak’
it. It is not uncommon for these two versions to differ, as vowels may need to be
270
modified for the sung voice to carry at certain pitches and syllables may need
to be substituted, added or deleted to adhere to rhyme or other types of poetic
form. In some cultures, the text is rarely uttered outside the ritual context, and
so it may be edited, contracted or modified in various ways in exegeses. Patterns
unique to song, such as line structure, repetition and sound patterning may only
be apparent in the sung form. To summarise, when working on song you will be
working on two forms: the sung form and the explanation or spoken version,
both of which may have multiple versions and both of which should be tran-
scribed and translated.
must … regard the first draft of a Beethoven melody as the original and all sub-
sequent developments as corruptions, including of course, the melody in its final
and published state”.
Figure 11.2
A transcription of a line of song in modern western music notation of rhythm alone, and the same
material represented by a metrical grid
Source: Hayes & Kaun, 1996
Analysis of songs requires looking for salient patterns, which may be pervasive
and obligatory, or occasional and incidental (Epps, Webster, & Woodbury, 2018).
Patterns –whether of sound, grammar or meaning –are often organised around the
line. Hymes (2003, p. 11) states that “presentation in terms of lines and verses makes
visible the shaping artistry of narrators”, noting that such presentation makes it far
more possible to perceive repetition, parallelism, alliteration and meter (the rhyth-
mic properties of song). For example, lines in English folk songs often have a set
number of beats, with beats most often filled by a stressed syllable. This is illustrated
in the four 4-beat lines of ‘Mary had a little lamb’ below, where the beat is filled by a
stressed syllable (for example in line 1: MA-ry HAD a LI-ttle LAMB). Note that the
dash represents an empty beat:
B B B B
Mary had a little lamb
B B B B
Its fleece was white as snow –
B B B B
And Every where that Mary went
B B B B
The lamb was sure to go –
There may also be rules about where a word boundary must occur. For example, in
the French Alexandrine meter this occurs after the sixth syllable in the 12-syllable
line. Other poetic traditions constrain not so much the number but the ways in
which syllables or stress etc. are positioned in the line. In Tohono O’odham songs (a
Native American language and people of the Sonoran Desert) adjacent or line-final
stress is not permitted and so the songs systematically manipulate the morphol-
ogy to avoid this (Fitzgerald, 1998). Songs can also be analysed for their meaning;
although the communicative function of language is often downplayed in song. It is
common for songs to have multiple meanings, and to be semantically ambiguous.
We discuss this further in the next section.
Navajo culture, where the researcher found that asking a Navajo speaker what
they thought the poet intended was often met with silence. The researcher “soon
realized that, for Navajos, direct speculation about what was going on in other
people’s minds was highly culturally inappropriate, and was in fact associated
with ‘witchcraft’ ”.
Another common feature in songs is the use of words or phrases with multi-
ple or deliberately broad meanings. Many forms of verbal art also have broader
meanings and significances that are not derived from their linguistic content. In
such cases, a glossed and translated text may remain opaque unless additional
exegeses or explanations of the cultural knowledge or logic behind such texts are
included.
A further issue is how to bring out the aesthetic qualities of the texts and pre-
sent them to a wider audience. For example, how might you translate vocables or
euphonics, such as “lalala” (often glossed voc), archaic or special song words; and
changes in linguistic variety or register, which may represent particular characters
or places? You might also consider how poetic structures such as metaphor, rhyme,
alliteration, parallelism and puns might be represented. For songs that are viewed
as enacting particular events, such as making rain, healing the sick or other types of
performatives (think of the pronouncement of marriage), you might also consider
how these functions are to be communicated. Representing such poetic features in
the language of another culture can require creative thinking, so an explanation of
how the translator arrived at their translations can be highly insightful. If it is pos-
sible to collect multiple translations of a single song this can be highly revealing
(Mitchell & Webster, 2011). The three translations of the Navajo poem by Rex Lee
Jim are from Webster (2017, p. 178); and they demonstrate the sonic and semantic
resonances of words and poetry and the insights gained from multiple translations
(Epps, Webster, & Woodbury, 2018):
In this section we discuss ways to manage and compare a corpus of verbal art, draw-
ing on the methodology described by Turpin & Henderson (2015) and Barwick
(2006). In terms of the recordings, it is useful to mark up each song in a perfor-
mance. This can be done with a separate tier in ELAN (§4.3.2) or in a sound editing
program and exported as individual song files and assembled in an iTunes database.
The latter has the advantage of enabling songs to be easily retrieved, played and
275
Figure 11.3
File hierarchy of the archival recording JS01_[Link] transcribed and segmented into sequential
song items (JS01_153-01.mp3 etc.). The original audio file is JS01_[Link] and the transcription of this
audio is in various formats –.eaf, .trs, .pfsx and .txt (Illustration: Maxine Addinsall)
Putting extracted songs into your local iTunes database is a useful way of manag-
ing recordings on your computer. This widely used software keeps songs and their
metadata together through the iTunes [Link] file, which iTunes uses to organise
the songs in your library and the metadata that you’ve created. As the song is being
played, information provided by tradition bearers can be entered directly into the
iTunes database. The iTunes metadata can also be exported as a catalogue by select-
ing all the songs and copying and pasting into a blank Excel spreadsheet. It is also
important to back up the metadata in a spreadsheet regularly in the advent of an
iTunes database malfunction.
You can create an iTunes playlist for playback in the field for analysis of the songs,
and as a prompt for discussion. Playlists can be loaded onto a portable music player
connected to high quality portable speakers via a 1/8-inch cable. Make sure to
record the playback of the songs as well as the speaker’s explanation of the songs to
ensure the song and the explanation stay together.
276
Performers and other community members no doubt will want copies of a per-
formance or collections of songs, such as all the songs sung by person X, or all
the songs of repertoire X. There are multiple ways of returning recordings to
the community. You may provide physical copies on CD, DVD or USB sticks,
or if the community has internet access and approves, media can be uploaded
to Facebook or YouTube for broader access. In iTunes, playlists can be created
which can be burnt to CD, transferred to a USB stick or uploaded onto the
internet.
Stephen Morey works on endangered languages on the India-Myanmar border
and has recorded many songs performed by native speakers in numerous languages.
When one of the major speakers passes away, he puts one of their songs or stories
on Facebook as a tribute. Family members express their appreciation for the public
recognition of these people and their skills.
Making copies of videos for the community takes more time than audio, but of
course the reward is so much more for the users. It may not be possible to do this
whilst in the field. In general, leave copies of new audio recordings with the relevant
community members before leaving the field, and on subsequent fieldtrips bring
back edited video. A summary of the software useful for working on verbal arts is
listed below.
• For transcribing: ELAN and/or Transcriber (see §4.3.2) software
• For segmenting: Audacity free software (or Sound Studio), LAME MP3
codec7
• For analysis: Transcribe!8
• For managing files and export: iTunes.
27
The community may want to publish recordings of their songs or other verbal art.
This can help the community promote their songs, develop a public profile for per-
formers and provide an opportunity to acknowledge and protect the rights of both
the creators and performers. Having the audio and/or video of songs available also
adds value to any text publications of songs you compile. Few published ethno-
graphic field recordings make money, so it would be unwise to present this as an
economic opportunity. In this section we discuss the issues and steps involved in
publishing songs.
If the community and you wish to make a quality multimedia package, you will
need to work with a sound engineer and/or producer and designer. You will also
need to ensure appropriate legal and financial arrangements are in place to protect
copyright and other intellectual property rights (§2.4.4). For wide distribution, both
physical and digital publishing (e.g. iTunes, Spotify) options are important. For all
these reasons, it is a good idea to engage a record label, especially one that special-
ises in ethnographic or Indigenous music (for example, Smithsonian Folkways in
the US and Skinnyfish Music or CAAMA Music in Australia). Ethnomusicologist
Steven Feld’s album of Kaluli songs is one such example, where he teamed up with
musician Mickey Hart and the Rykodisc record label. Talk to the record label as
early as possible in the project; they may even collaborate with you to seek funds to
produce a quality recording.
When I went to Papua New Guinea in 1975 with that stereo Nagra,
anthropologists –certainly anthropology doctoral students –were writ-
ing ethnographic monographs as a principal standard for representation.
Recording for radio broadcast or LP production was understood to be
totally secondary or superfluous in terms of serious ethnographic repre-
sentation. I wanted to buck that trend. My first New Guinea publication
was also the inaugural LP in the Institute of New Guinea Studies series.
I wanted to accomplish two things: to herald in sound the publication of
my ethnographic monograph, Sound and Sentiment, which came out the
next year, and to honor a commitment to helping Papua New Guinea’s
newly independent government and research institute start an LP record
series to encourage researchers to make field-recordings into circulatable
representations of cultural history.
(Steven Feld interviewed by Angus Carlyle –Lane & Carlyle, 2013)
For academic publishing of songs there are a number of options for including
audio or video with the analysis of songs:
• CD/DVDs with detailed sleeve notes or web delivery (e.g. iTunes)
• uploading audio/video files to a website to accompany a publication
• embedding audio/video into a digital pdf publication (e.g. Adobe Acrobat)
• embedding audio into a hardcopy publication using QR codes or
Soundprinting.
New possibilities will continue to emerge as technology changes. No matter which
format is chosen, it is crucial to obtain permissions from the copyright holders of
songs to reproduce their songs; and evidence. Evidence of this is usually required
by the publisher. Unlike speech, songs are often considered creative works and are
often owned by individuals or groups of individuals. Under international copyright
law, songs, poetry, music and dances are defined as ‘works’ and so researchers have
a responsibility “to acknowledge the moral and legal rights of musicians and per-
formers under traditional and international law” (Barwick, 2012, p. 172). In addi-
tion to the intellectual property rights discussed in §2.4.4, many countries also have
specific creative arts laws that aim to protect singers and other artists.9 Local views
of song ownership, which may or may not be formalised, may be quite different. It
is a good idea to include this in publications of songs, such as the following from
the book Jardiwanpa yawulyu: Warlpiri women’s songs from Yuendumu (Gallagher
et al., 2014, p. 2):
The songs, stories and images in this book contain traditional knowledge of the Warlpiri
people, and have been presented and published with the consent of the knowledge
custodians. Dealing with any part of the knowledge for any purpose that has not been
authorised may breach the customary laws of the Warlpiri people, and may also breach
copyright and moral rights under the Copyright Act 1968 (Australian Commonwealth).
Please contact the publisher for further details.
Commercial release of albums can be a good idea for wide access; however, there is
always the potential for third parties to use commercial releases in ways that violate
the performer’s legal and moral rights, or represent the performers and their com-
munity in ways that the community and performers do not like, and no doubt many
other ways yet to be imagined. The community and researcher can remain power-
less against large commercial companies who wish to use recordings in such ways
(Feld, 2000; Seeger, 2004).
Some tradition bearers may feel it inappropriate to sell recordings of verbal
art and decide against royalties (even though royalties are usually small). In the
279
records are even greater. In some cases, archiving is at odds with traditional trans-
mission processes, as you can read in the Tai Ahom case study.
As songs are often regarded as having cultural and symbolic capital and metaphysi-
cal power, custodians may choose to restrict access to song recordings whereas they
might not want to do so for speech. There may be concerns over potential unintended
usage of song recordings, including the possibility of causing physical damage in the
world where songs have metaphysical power. Morey (pers. comm.) recalls that after
the Wihu festival in 2008 in northern India, the couple who performed a traditional
welcome to their house came to see him the next morning quite worried that he might
have suffered ill effects of the songs which have real power. A lack of understanding
about the archive and how access conditions are adhered to might also lead custodi-
ans to err on the side of caution, with more restrictive access conditions.
Recordings can be used as a resource for learning; while this is seen as a good
thing if in the hands of the right people, the community may not feel they can
control this. It is very easy, for example, for a USB to be lost, copied or stolen, and
if the recordings are online, one cannot control who else may be sitting beside a
person who has legitimate access. In some cultures learning songs is not a right, but
rather something one must prove oneself worthy of, and it is difficult to reflect these
281
conditions in the access instructions for records in the archives. This is currently
not a practicality that the archive can support. As a researcher, however, by facilitat-
ing a more direct link between the archives and the community, issues of commu-
nity access and use may be improved over time.
would like to record them (if they don’t want to be recorded just use a pen and
paper). Ask if they can recall any songs or poetry from their childhood. Did they
sing any nursery rhymes, lullabies, church songs or love songs? You should have
a few exemplars up your sleeve to try to trigger their memory. Try to write them
down, noting any explanations they give. Ask about any meanings or associa-
tions they have. Do they remind them of a place, people and times? How did
they learn the songs, did they pass them on? If recording, ask if they would like
you to play the recording back to them. Does this elicit any further comments?
Note any unclear words or meanings. How different are the songs to speech?
Transcribe the song/poem(s) text. You might like to try the rhythm in either
a metrical grid or musical notation too. What is the relationship between the
stressed syllables and the strong positions of the meter? What is the relationship
between word boundaries (prosodic, lexical, morphological) and boundaries of
the poetic form, e.g. line, verse, half-line (also called ‘hemistich’)?
11.15 SUMMARY
In this chapter we aimed to encourage field linguists to pay attention to verbal arts,
such as poetry and song. We outlined the benefits of this and suggested a methodol-
ogy for working on verbal arts, managing recordings and disseminating them in the
community. We discussed differences between the form and function of verbal arts
and speech, and considered how verbal art can be considered as having multiple
layers of translations. We also discussed issues to consider when publishing and
archiving songs and suggested intercultural exchanges as an exciting context for
disseminating and promoting fieldwork that benefits the community.
For a discussion of the importance of verbal art to linguistic fieldwork see Epps,
Webster & Woodbury (2018), and for tools and techniques for recording verbal arts
see Barwick (2012). For a particularly African focus on fieldwork on song from a
musicological perspective see Wasamba (2015). For an introduction to the formal
properties of verbal arts see Fabb (1997).
NOTES
5 [Link]
6 [Link]/.
7 [Link] Accessed 27 August 2017.
8 [Link] Accessed 27 August 2017.
9 Despite this, there are instances of laws being broken and communities not having the
power or money to afford legal action to assert their right (Feld, 2000).
REFERENCES
Hayes, B. (2009). Textsetting as constraint conflict. In J.- L. Aroui & A. Arleo (Eds.),
Towards a typology of poetic forms: From language to metrics and beyond (pp. 43–61).
Amsterdam: John Benjamins.
Hayes, B., & Kaun, A. (1996). The role of phonological phrasing in sung and chanted verse.
Linguistic Review, 13, 243–303.
Hymes, D. (2003). Now I know only that far: Essays in ethnopoetics. Lincoln, NE: University
of Nebraska Press.
Ingram, C. (2013). Understanding musical participation: “Listening” participants and big
song singers in Kam villages, southwestern China. In C. Ingram & I. Russell (Eds.),
Taking part in music: Case studies in ethnomusicology (pp. 53–68). Aberdeen: Aberdeen
University Press.
Jakobson, R. (1960). Closing statement: Linguistics and poetics. In T. A. Sebeok (Ed.), Style
in language (pp. 350–377). Cambridge, MA: MIT Press.
Lane, C., & Carlyle, A. (2013). In the field: The art of field recording. London: Uniform Books.
Liberman, M., & Prince, A. (1977). On stress and linguistic rhythm. Linguistics Inquiry, 8,
249–336.
Marsh, K. (2008). The musical playground: Global tradition and change in children’s songs and
games. New York, NY: Oxford University Press.
Mitchell, B., & Webster, A. K. (2011). “We don’t know what we become”: Navajo ethnopoetics
and an expressive feature in a poem by Rex Lee Jim. Anthropological Linguistics, 53(3),
259–286.
Morey, S. (2012). Documentation of traditional songs and ritual texts: Issues for archiving.
In N. Thieberger, L. Barwick, R. Billington, & J. Vaughan (Eds.), Sustainable data from
digital research (pp. 119–136). Melbourne: University of Melbourne.
Morey, S., & Schöpf, J. (2011). Tone in speech and singing: A field experiment to research
their relation in endangered languages of north east India. Language Documentation and
Description, 10, 37–60.
Perkins, R. (2016). Songs to live by. The Monthly, 124, 30–35.
Rumsey, A. (2011). Style, plot, and character in Tom Yaya Tales from Ku Waru. In A. Rumsey
& D. Niles (Eds.), Sung tales from the Papua New Guinea highlands studies in form, mean-
ing, and sociocultural context. Canberra: Australian National University ePress.
Russell, I., & Ingram, C. (2013). Taking part in music: Case studies in ethnomusicology.
Aberdeen: Aberdeen University Press.
Seeger, A. (2004). New technology requires new collaborations: Changing ourselves to better
shape the future. Musicology Australia, 27(1), 94–110.
Sharp, C. (1965). English folk song: Some conclusions. Belmont, CA: Wadsworth.
Sharp, C. (1973). English folk songs from the Southern Appalachians. London: Oxford
University Press.
Turpin, M., & Henderson, L. (2015). Tools for analyzing verbal art in the field. Language
Documentation and Description, 9, 89–109.
Turpin, M., & Stebbins, T. (2010). The language of song: Some recent approaches in descrip-
tion and analysis. Australian Journal of Linguistics, 30(1), 1–17.
Wasamba, P. (2015). Contemporary oral literature fieldwork: A researcher’s guide. Nairobi:
University of Nairobi Press.
Webster, A. K. (2017). “So it’s got three meanings dil dil”: Seductive ideophony and the
sounds of Navajo poetry. Canadian Journal of Linguistics, 62(2), 173–195.
285
12
A final word
Nicholas Evans, a linguist who has worked extensively with speakers of endangered
languages in northern Australia and Papua New Guinea, describes the urgency and
importance of the task of field linguists.
(In) 2003, I attended the funeral of Charlie Wardaga, my teacher, friend, and classifica-
tory elder brother … The book and volume of his brain had been the last to hold several
languages of the region: Ilgar, which is the language of his own Mangalara clan, but also
Garig, Manangkardi and Marrku … Although we had managed to transfer a small frac-
tion of this knowledge into a more durable form before he died, as recordings and field-
notes, our work had begun too late … For his children and other clan members, the loss
of such a knowledgeable senior relative took away their last chance of learning their own
language and the full tribal knowledge that it communicated…
(Evans, 2011, p. xvii)
Many of us who work in the field are familiar with these heart-wrenching stories,
whether they concern our own families, or those of whom we have come to respect
and admire in the processes of documenting other people’s languages from an ‘out-
sider’, yet potentially honorary ‘insider’ perspective. We hope that this book equips
all researchers with the necessary skills, and also a solid grounding in the ethical
dimensions of following the fieldwork path, which at times can be a challenging
track to pursue. We also hope that we have reflected some of the enthusiasm and
excitement that can come from learning new languages, meeting new people and
in the end making some contribution to counter the catastrophic loss of language
diversity the world is facing. And maybe discovering new words, previously unat-
tested phonological phenomena, grammatical structures and new signs for things
along the way!
Across the globe there are many people wanting to learn and engage with their
heritage languages in meaningful ways. What this loss of language diversity means
to language communities is ultimately not for linguists to determine, but solid
research practices and transferring research materials to “durable form” can mean
that the doors are not shut before people who speak endangered languages have had
a chance to form their own opinions and determine their own strategies for protect-
ing their cultural heritage.
In spite of loss, the diversity of the world’s languages is truly breath-taking. The
case studies discussed in this textbook are just a sample of the many hundreds of
projects currently underway across the globe. There is no shortage of projects yet to
be undertaken, languages to document, and research questions to be answered about
the uniquely human capacity for language. Capturing the breadth and complexity
DOI: 10.4324/9780203701294-12
This chapter has been made available under a CC-BY-NC-ND license.
286
of the world’s languages often means undertaking field research in remote regions
and adopting the highest standards in data collection and curation. Although field
research also takes place in urban settings, the aim of this book has been to ensure
that students undertaking research on endangered languages in remote areas or
in countries which have undergone European expansion are well prepared for the
challenges of fieldwork in these complex settings.
This book has been designed to train prospective field linguists in methods to
record, annotate and archive quality linguistic data in an ethical and responsible
manner. This data is crucial for the development of audio-visual language cor-
pora which form the basis of many outputs standard to documentation projects,
such as grammars, dictionaries and text collections, as well as being a crucial
contribution to community efforts to maintain or revive their languages. They
consist of examples of language-in-use across a range of communicative con-
texts and genres, including narratives, elicitation sequences, procedural texts
and conversation.
As well as corpus development, different subfields of linguistics, e.g. phonology,
morpho-syntax and semantics, all require different types of data for answering
research questions specific to these subfields. We discuss specific elicitation tech-
niques including (semi-)experimental tasks that are required to delve deeper into
these subfields, and the equipment set-ups and annotation standards required to
elicit this data. Other subfields require even more specialised methods and ethical
considerations, for example gesture and sign language, language acquisition, con-
tact languages and the verbal arts. Suggestions for how to get started in designing
field projects are detailed in these chapters, and in doing so, this textbook takes the
students beyond the fundamentals of documentation and into more sophisticated
tools and methods for these advanced topics.
We have presented a broad view of language which extends beyond speech
into the multimodal, performative and aesthetic dimensions of communication.
We have encouraged students to adopt a community-orientated approach which
aims to build collaborative language teams. In addition to the more standard
linguistic methods, we hope to have encouraged students to reach out to other
disciplines –anthropology, musicology, history and scientific subfields such as
biology –to provide depth to their documentation projects. Of course not all of
this can be achieved in the short time-span of a PhD or small documentation
grant, but we hope to have inspired students to imagine their practice as field
linguists as involving long-term collaborative relationships with speech commu-
nities. For the fieldworker who does not aspire to a long-term relationship with
a given speech community, we press upon you the importance of making your
research data accessible through appropriate archiving and by working towards
community publications as part of your responsibilities to the communities you
work with and for.
Like the resilient shoots of the bush potato, a plant that thrives on disturbed
ground and engenders new plants that flourish on the surface, working on
287
REFERENCE
Evans, N. (2011). Dying words: Endangered languages and what they have to tell us. Malden, MA:
John Wiley & Sons.
28
289
Appendices
290
Map 1
Map of locations of major languages referred to in this book (Map: Brenda Thornley)
26
44 23
20
22
16 24 21
33
40 25
37
35
18 19 17 30 36 39
38
32 29 28
34 31
15 43 27
1,2,4 5 11 41
13 3 6,7,8
9,10 14
12
42
Chapter 1 Introduction
Exercise 1 Discuss why you think the following are or aren’t examples of fieldwork?
Just because you contract an exotic tropical disease doesn’t mean you are conducting
fieldwork!
• You are a phonetician and accompany another linguist to the field and
spend a week making enough recordings for your thesis.
This is considered fieldwork because it is recording language in its speech community.
Bear in mind though that you may not gain much knowledge of the language and its
community in such a short trip and follow the more seasoned field linguist’s lead in the
field in terms of behaviour and protocols.
• You hear that speakers of an endangered language from western China are
visiting the anthropology department and you conduct a number of psycho-
linguistic tasks in the lab while they are there.
This is not typical fieldwork because it is recording language outside the place where it
is spoken on a day-to-day basis; and because you are working and living in your usual
location. It would also be unlikely that approval for such work could be organised in a
short time, which is not to say that such work might not be valuable.
transcription and translation, and also interpreting. The grandparents will form the
core of the team as speakers of the language. It is quite possible that they do not speak
the regionally dominant language and will be unlikely to be literate in either language.
This combination of language skills across the generations will form a solid team for
documenting the language.
in a footnote. This is something to negotiate of course. In the end, you might not agree
with the interpretation of the findings and want your contribution acknowledged, but
not to be named as an author.
which captures the context. Another common context where clipping of video images
occurs is in child language research where children walk in and out of the frame. This
is inevitable, so just make sure the child is wearing a radio microphone so that at least
their utterances are captured with consistent levels of audio.
texts into nicely formatted story books (easily done in Word or PowerPoint). Make
sure you laminate the pages and bind them with wire (not plastic) so they survive
weather and enthusiastic small children. Make class sets, not just individual copies
of books. Do not just print out ELAN or CLAN files as the formats are not useful or
aesthetically compelling to non-technical users. Put the recordings onto portable MP3
players and bring plenty of spare batteries (or rechargeable batteries and a solar set up
which you can leave at the school).
2.
p b t d ʈ ɽ k g
#__a 3 ɛ__i 1 #__u 6 n__i 5 #__æ: 4 a__a 3 #__aː 7 ŋ__aː 2, 9
#__o 12 u__u 6 #__iː 13 n__ɛ : 29 #__ɛ ː 11 æː__i 4 #__ĩ ː 10 u__u 27
#__ↄ 14 m__a 8 #__ɨ 19 #__ǝ ː̃ 18 ↄ__u 14 #__é_ 16 ŋ__ǝ 28
#__ǝ 23 m__u 15 #__ɑ
ː 22, 32 #__ɑ ː 25 é__i 16 i__aː 16 ŋ__ɛː 30
#__ǝ ː 24 eː__i 20 #__æː 17 ɨ__ǝ 19 #__ɑ ː 21 ɑː__ɯ 32
#__i 26 i__i 26 æː__ɯː 17 əː__ɯ 24 #__u 27
æː__æː 31 ǝ__ǝ 23 ɛː__ɛ 30
3. intervening glide or liquid between stop and vowel, e.g. pop vs plop, prop;
coop, cub vs cube; test, taste vs twist
4. complex coda e.g. cold /scold; tilt /stilt.
word EM NM NS Difference
* We use (N) as shorthand for New Missionary and New Secular orthographies; and (M) for Early
Missionary and New Missionary orthographies
29
2. Compare the orthographies and make a numbered list of (a) the differences
in consonants (b) the differences in vowels.
Chapter 6 Morpho-syntax
Exercise 1 Recording stories as a ‘newbie’ to the community
Imagine you only have a few more days in a speech community before you have to
head home. You have only been able to perform formal elicitation with the speak-
ers. You plan to return but the speakers are elderly and you are worried that the
elicitation you have done is not very dynamic. There is also a chance that your
301
recordings may be the final record of the language. How do you create situations
where speakers will tell stories?
You need to find out what is really important to the community and make a
concerted effort to record them e.g. sacred texts, historical narratives or proce-
dural descriptions of endangered cultural practices. Speakers might require the
right combination of people to be present for recordings, so take direction from
them about who to gather together. Often this process requires more time and
effort than the standard dyadic set-up (linguist-speaker) but the depth of cultural
knowledge and complexity of language will increase markedly. If cultural prac-
tices are the focus, again help gather the right people, tools, ingredients etc. (and
include this as a part of the documentation process –videos like these are useful to
playback and elicit voice overs for). This process might involve enlisting younger,
more able-bodied people who can assist the elderly speakers in the activity. Have
them instruct the younger ones in the procedure, and make sure you record the
entire process.
someone’s head, or (ii) animates acting on inanimates e.g. a man eating a coconut, or
equipollent examples i.e. animate > animate, inanimate > inanimate. Remember that
it might be humanness or sentience that is the relevant category so vary the actors you
use in elicitation. Try to avoid examples relating to violence, remembering that this is
a lasting record of the language.
• Variation in the use of a case-marker?
Generally conversation data. Variation in subject marking (e.g. optional ergativ-
ity) usually has a discourse component and probabilistic rather than categorical
statements about use in the grammar will be most appropriate. Variation in object
marking (e.g. differential object marking) is often related to definiteness or perfec-
tivity and is also often more categorical so you might have more luck with formal
elicitation.
Allative Goal to
As well as the literal meaning of the constellation names, you might include the con-
text that gives rise to the polysemy. Be aware though that some sorts of mythological
knowledge are restricted in some cultures, and so you should always check before mak-
ing this knowledge publicly available.
Explore the semantics of arguments that the verb can take (e.g. people, animates,
inanimates). With an animate subject, can a clause have two different meanings? For
example, “The boy is shy” or “The boy is cold”? If so, this suggests two related meanings
or monosemy. What happens when you combine different subjects, e.g. “The honey
and people shivered/went hard”. Is there a zeugma effect? If so, this suggests there are
two different meanings. (Zeugma is where a word applies to multiple words in the
sentence but with a different meaning, e.g. “You are free to execute your laws, and your
citizens, as you see fit”.)
actions (such as lying down) can utilise this frame. The definition also suggests that
there may be particular cultural practices associated with domestic spaces, for exam-
ple sleeping with one’s head to the east. Compare your own folk definitions. In your
language would you say something like, “Move the computer cursor a little to the
north?” or would you use terms like ‘left’, and ‘right’ instead?
\lx lexeme
\ps part of speech
\de definition
\xv example sentence (in vernacular language)
\xe example sentence translation (in English)
\sn sense number
\pl plural form
to familiarise students with the processes involved in structuring linguistic data for
outputs such as dictionaries.
point in one direction, and when they talk about ‘afternoon’ they point to another.
You are not sure whether this action is anchored to the body and thus shows a left/
right distinction or whether the speakers/signers are using of an absolute frame of
reference (i.e. fixed bearings such as north, south, east, west). How could you test
these possibilities further? What else should you look out for if you are working
inside a building?
One way to test whether the sign or gesture is anchored to fixed bearings, such as com-
pass directions, or to the person’s body (their left or right side, or even their front or back)
is to use the rotation paradigm, a method that has been used extensively to explore
different frames of reference in languages. You could try to elicit conversations about
times of the day with the participants facing in different directions on repeated occa-
sions. Although it may be possible to record naturalistic data, you would be very lucky
if there were enough fortuitous spontaneous examples that helped answer this question
(for example conversations about time while sitting in relaxed outdoors locations: on
one occasion facing north and on others facing south, or east etc.). The alternative is
a semi-experimental method where data is recorded in a fixed location, and yet the
spatial orientation of recording set-up is varied, or rotated from one session to the next.
If the actions are consistently deployed in the same direction, regardless of the direction
in which the participant is facing (for example when someone signs morning the sign
is directed to the east no matter which way they are facing), then this suggests that an
absolute frame is being employed. If the action was always directed towards the person’s
right or left-hand side, no matter which way they are facing, then this suggests that the
action is based on the body axis (or relative frame). It is very important that you keep
careful account of the direction that people are facing in recordings, and this becomes
part of your metadata. If you are working inside a building it would be relevant to note
whether a participant was directing an action in absolute space (perhaps to the east) or
whether they were actually pointing at a clock on the wall inside the room!
action alone, without supporting speech. Lastly it would be interesting to see if the
Ngaatjatjarra have any metalanguage to talk about different types of communicative
action – are there any Ngaatjatjarra words for ‘signing’, ‘gesturing’ or ‘pointing’?
researcher, or the archive, and (ii) metadata in archival deposits may not be detailed
enough and ‘future-proofed’ and so it might be very difficult to interpret the instruc-
tions, or find the people who have rights to the materials, (iii) the researcher who made
the recordings may have passed away.
Participants may change their minds over time for various reasons. If a participant
no longer wants their recordings to be available for research, talk to them about their
concerns. It might be something which can be resolved (e.g. they may be happy with
anonymised transcripts to continue to be used in research but not the recordings them-
selves). If the issue can’t be resolved, then you should update the consent information,
and if possible document the new access instructions.
You should inform the archive of these concerns, and not use this material in your
research.
they will work in the field, or what additional equipment you might need to make
them work?
‘Act out’ and ‘Picture-choice tasks’ are definitely possible in the field. ‘Act out’ in par-
ticular does not require technology, just props and a quiet place to work to help children
attend to the task. Note though that if you can’t pre-record the stimulus, it will require
a research assistant to say sentences in exactly the same way each time. ‘Picture-choice
tasks’ are best done using computers and again pre-recorded audio stimuli, but again
they can be undertaken using laminated cards and spoken renditions of stimuli. Tasks
such as ‘intermodal preferential looking paradigm’ are possible if the community has
electricity and a quiet indoor place to work in, and which can be stripped of other
distractions.
is always used when the subject is definite and the other construction is always used
when the subject is indefinite. If you are dealing with a single construction which
shows variation, the variation will usually be ‘probabilistic’ i.e. the variant will tend
to be used in one context and not in others, and is often driven by sociolinguistic
categories e.g. age or gender, or discourse factors, but also grammatical cues! If you
have a variable construction, find as many examples of it in the corpus as you can,
provide the basic percentage in the grammar and an overtly tentative explanation e.g.
“X construction is used 70 per cent (n=15) of the time and seems to be used by younger
speakers in past tense clauses”.
B B B B
There once was a línguist that’s Sweet Ø
B B B B
Who could place London speech within feet Ø
B B B B
This phi- lólogist tóld of Ger-mánic of old
B B B B
But left us befóre he could tweet Ø
The poem is a limerick and it has triple rhythm. Lines one, two and four rhyme and
have the same line final rhyme. The two hemistiches of line three also have a different
final rhyme. While the standard orthographic tradition puts the two hemistiches of
line three on separate lines, its meter is four lines of four beats.
again presumably sequential. Thus, unlike the file names in §4.2.1, there is no lan-
guage, date or linguistic type (e.g. sign) in this file name.
Appendix 3: Glossary
elicitation (formal) –the act of obtaining language data from another person
which involves the use of questionnaires, stimuli or translation equivalents.
encyclopaedic dictionary –a dictionary that includes extensive cultural and ency-
clopaedic information, discussing each headword in depth.
finder list –a section at the end of a bilingual dictionary that reverses the direction of
the languages. A finder list or ‘reversal’ assists a user in getting to the right part of the
main dictionary where they can find out more about a word and its meanings.
folk definition –a native speaker’s or popular explanation of the meaning of a word
or phrase.
folk taxonomy –a popular classification system.
gatekeeper organisation –an organisation that has authority over the area or peo-
ple where you wish to do fieldwork. These organisations protect the people they
represent (e.g. against fraudulent activities) and can offer advice, official approv-
als or endorsement of research.
grammaticality judgement –a native speaker’s judgement on the well-formedness
of an utterance. Grammaticality is a matter of linguistic intuition, and reflects the
innate linguistic competence of speakers.
homophone –words that sound the same but have different meanings, e.g. aloud
and allowed.
homonym – homophones that are spelt the same way, e.g. bank (river) and bank
(financial institution).
ideophone –a word that evokes an idea in sound (e.g. whoosh in English).
Ideophones are found in many of the world’s languages, although they are
claimed to be relatively uncommon in Western languages.
‘insider’ linguist –someone who is a native speaker of the language they study or
is from the community that speaks the language.
intellectual property rights –refers to a collection of rights including copyright,
patents and trademarks.
interoperability –the ability of different types of software to import or export into
formats used by other software.
language acquisition –the study of how children acquire language(s).
language description –a grammar (including phonology), text collection and dic-
tionary (or at least a wordlist).
language documentation –a record of the linguistic practices and traditions of a
speech community. Language documentation prioritises texts and a variety of
different types of speech.
language vitality –the extent that a language is used as a means of communication in
various social contexts. The most significant indicator of a language’s vitality is its daily
use in the home. A language with high vitality would be one that is used extensively
both inside and outside the home, by all generations, and for most, if not all, topics.
Leipzig Glossing Rules –standardised labels for interlinear morphological glosses.
lexicography –the craft of compiling, writing and editing dictionaries which
involves analysing and describing the semantic, syntagmatic and paradigmatic
relationships within the lexicon (of a language).
317
lingua franca –a language that is adopted as a common language between speakers
whose native languages are different.
metadata –is data about data. Metadata essentially answers the who, what, why,
where, when and how questions about a fieldwork ‘session’ and the audio-visual
or other items resulting from the fieldwork.
meter (poetic) –the rhythmic structure of a verse or lines in verse. Many verse
forms use a specific verse meter, or a set of meters.
metrical grid –a way of representing rhythm in speech, poetry and song. Syllables
are arranged along the bottom row and ‘x’ represents prominence; the higher the
column of ‘x’s the higher the prominence.
minimal pair –words that are identical except for one feature or sound.
mixed language –a language which is the result of extensive community bilingual-
ism and code-switching, often developed as an expression of an altered identity,
be it new, or differing significantly from an older identity.
Multi Dictionary Formatter –software designed by SIL International which con-
verts a well-structured dictionary database into a formatted dictionary.
nasal harmony –a phonological pattern where nasalisation extends over a sequence
of segments.
non-linguistic stimuli –images, sounds, smells or touch stimuli used to elicit
semantic, phonetic or morpho-syntactic contrasts.
oral literature –a broad term that covers the verbal arts as well as myths, leg-
ends, proverbs, riddles, tongue-twisters, word games, recitations and historical
narratives.
orthography –a spelling system used to write a language.
‘outsider’ linguist –someone who is not a native speaker of the language they study
and comes from a different culture.
participatory models of linguistic research –research that values everyone’s
unique contribution equally; including a community member’s unique knowl-
edge in knowing about their community, ways of working etc.
performance arts –verbal arts that may include non-linguistic aspects such as
dance, visual designs and ritual actions.
phonemic principle –a principle used in orthographic design that states there
should be a one-to-one correspondence between the orthographic and phonemic
representation of a word.
phonetic transcription –a transcription of linguistic sounds using the International
Phonetic Alphabet which includes phones as well as tone, stress, nasalisation as
well as other prosodic and suprasegmental features.
pidgin language –a non-native simplified colonial language (e.g. English, French,
Arabic) used as a lingua franca between a number of groups, which otherwise do
not have a shared language.
poetic function –the use of language for artistic affect that draws attention to its
own form. This contrasts with everyday speech which prioritises the communi-
cative function of language.
polysemy – the association of a word with two or more distinct meanings.
318
project information statement –an explanation about what the research project is.
proprietary –software which is developed by a commercial company. Typically the
source code for proprietary software is not made available for further develop-
ment by amateur programmers.
psycholinguistics – a branch of linguistics that is particularly concerned with
cognitive approaches to the study of languages and how they are acquired and
used.
recruitment –finding language speakers to work with. It is important to recruit
people in an ethical manner, e.g. without coercion or misinformation; and by
stating clearly your aims, what the work will involve and what people will and
won’t get out of it.
register –a variety of language used for a particular purpose or in a particular
social setting e.g. baby talk or mother-in-law language.
scansion –analysing a line of verse to determine its metrical properties.
segmental inventory –the chart of contrastive discrete sounds in a language, spe-
cifically the consonants and vowels.
semantic domain –an area of meaning and the words used to discuss it.
source language – in lexicography the source language is the language from which
the translation is made.
Swadesh list –a wordlist of 100 and 200 concepts that exist in most languages and
used for basic elicitation and comparison.
target word –the word that you would like the speaker to say.
target language – 1. the language being researched 2. in lexicography the target
language is the language of the translation, used to define lexical units in the
source language.
taxonomy –a network of superordinate and (generic)-subordinate relations. These
can be diagrammed as a tree with the generic represented with a higher node and
hypernyms connected to them by lines.
temporal arts –verbal arts that may include non-linguistic aspects such as dance,
visual designs and ritual actions.
text-setting –the way in which an oral or written text is aligned to a musical tune.
Traditional Knowledge (TK) licences –can be used separately or in combination
with CC licences to add conditions of use and information about how material
should be respectfully and ethically treated according to community expecta-
tions and obligations, i.e. moral rights.
transcription –the systematic representation of language in a written form. The
original language might be utterances (speech or sign language) or a pre-existing
text in another writing system.
verbal art –artistic forms of language including poetry, song, rhyming games, sung
tales, poems, ritual speech events, spells, advertising jingles, incantations, charms
and spells, children’s chants.
vernacular language –the native language spoken by a specific population,
which can be distinguished from a literary, national or standard variety of the
language.
319
version control –Making sure that multiple conflicting versions of a file are not
created, e.g. by several people working on it at the same time.
vocable –a sequence of speech sounds from a given language which form one or
more syllables but does not represent a word of the language.
wug word –a made up word that adheres to the sound system of a language and
is used to gauge linguistic awareness.
320
Appendix 4: Acronyms
Index
ablative 141; see also case American English 6, 97; Black American
absolute 196, 303, 305–306; see also frame of English 250
reference American Sign Language (ASL) 156
access: and archiving 279; of research analogue recordings 49–50, 56, 84, 89
materials 31–32; platforms 88–89; animacy 127, 132–133, 138, 143, 161
open 91–92 ANLA (Alaska Native Language Archive)
accommodation effects 124, 251–252 86, 315
accusative 126, 131–133, 139; see also case Anmatyerr 1, 166–169, 173–174, 204,
acknowledgement see authorship 207–208, 210, 289
acoustic properties 98–99, 107 annotation 18, 69, 79–83, 92; of sign languages
active see knowledge, voice and gesture 191, 193, 199, 211
actor 233; see also agent anonymity 35, 192–193, 205, 223; see also
Adamorobe Sign Language (AdaSL) 5, consent, ethics
191, 289 anthropology 8, 71, 264, 286
adaptors 56, 58, 60 antipassive see voice
adjectives 124–125, 143 ANVIL 199, 315
adpositions 140; see also prepositions; Appalachia 262
postpositions appropriation 32
adverbs 125, 134–135, 144, 247 Arabic 66, 79, 196, 240, 314
advisor 25, 36, 44, 60; see also supervisor Arandic languages 117, 123, 129, 173;
Afrikaans 82 see also Arrarnta, Arrernte, Anmatyerr,
agent 136, 255, 301; see also grammatical Alyawarr, Kaytetye
relations Archi 178–179, 289
agentivity 218–219; bias 218–219 archiving 83–92
agreement marking 125, 138, 143, 146, argument disambiguation 231, 233
149, 174 Arnhem Land 197
agreements see copyright; research agreements Arrarnta 117
Ahtna 86, 159 Arrernte 117, 185, 281
AIATSIS (Australian Institute of Aboriginal articles 125, 143
and Torres Strait Islander Studies) 87, articulation 98, 112–113; of sign and gesture
155, 315 63, 197–198
AILLA (Archive of the Indigenous Languages aspect 125, 133–135; see also Aktionsart
of Latin America) 86, 315 aspiration 96, 98, 106–107, 115, 296
Akan 191 astronomy 159, 180, 182
Aktionsart 133–134; see also aspect atelic see Aktionsart
Alaskan languages 86, 159, 170, 289, 302, 315 Athabaskan languages 133, 170
albums 277–279 Audacity 63, 275–276
Algonquian languages 133 audio connectors 54; equipment 50–55;
alienability see possession recording 61–65; stimuli 161
allative 139–140, 248, 301; see also case Auslan 197, 201, 203, 213, 289
alliteration 97, 261, 272, 274 Australian Aboriginal 87, 130, 224, 249, 263,
allophones 100, 102–105, 109, 296–298 265, 273; see also Indigenous peoples
alphabet 115; see also IPA, orthography; script authorship 38–40, 270, 279, 292
Alyawarr 1, 21, 87, 158, 289 auxiliary verbs 146, 244
32
322 Index
Index 323
324 Index
Index 325
lexicon 157, 160, 175, 238, 244, 263, 313; minimal: pairs 101–106, 126, 168, 314;
see also vocabulary triplets 101
Lexique Pro 175, 177 minority groups 4–5, 222, 224, 238, 250;
licences 32–33, 312, 315; Creative Commons see also Indigenous peoples
(CC) 33, 312; Traditional Knowledge (TK) Mirndi languages 141–143
33, 315; software 33 Miromaa Community Dictionary Maker 177
Light Warlpiri 221, 226, 229, 231–233, 289 missionaries 13–16, 26, 117, 147
limericks 272–273 Mississippi 136, 289
lingua franca 11, 37, 240–241, 313–314 mixed languages 231, 242–244, 248, 250–251,
linguist: ‘insider’ 2, 4–6, 7, 25, 123, 219, 230, 256–258, 314; see also contact languages
313; ‘outsider’ 2, 4–6, 16, 25, 123, 191, 220, modal verbs 146, 211
224, 249–250, 253, 314; see also gender Mohawk 3–4, 124, 289
linguistic description 8–9, 251, 313; mondegreens 273; see also lyrics
see also descriptive linguistics monitoring headphones 56
linguistic diversity 2, 10, 21, 89, 130, 187, monolingual fieldwork 11, 43, 123, 164
220, 285 monosemy 170, 303
Linnaean classification 153 morae 97, 263
list intonation 111 morphemes 97, 128, 145–146, 227
literacy 28, 37, 115–116, 173, 191–193, 195, morphological glosses 18, 79, 82, 201, 313
253, 256 morphology 108, 142, 174, 220, 272;
loan words 101, 242 derivational 145–146, 244; inflectional
locative 139–140, 195; see also case 145–146, 174, 217–218, 242, 252
longitudinal: corpus 218, 221, 225–230; ‘mother-in-law’ language 162; see also special
studies 216, 219, 225–226 registers
ludlings 97 Motu 14–15
luggage 42, 61 Mudburra 205, 215
lullabies 264, 282 Mueshaungx 97, 290
Luzon 167, 289 multilingualism 28, 112, 192, 236, 244
lyrics 268, 273; misheard 273 multimodal: communication 2, 31, 194, 286;
data 190, 205, 208
machine-readable 193, 203, 312 Mumbai deaf community 194
Mandarin 218; fieldwork manuals 44 Murrinhpatha 129, 216, 219–221, 223–231, 289
Māori 209 museums 25, 84
maps 43, 89, 174, 198, 301; sketch 69, 198, 301 musicology 21, 64, 261–263, 271, 286
Martha’s Vineyard Sign Language 186 mythology 155, 159, 314; see also religion
masculine see gender
Mauritian Creole 241–242; see also creole Nafaanra 176, 303
languages Nahuatl 179
Max Planck Institute (MPI) 80, 85, 113, 160, naming: files 73–75, 275; contact languages
195, 254 250; see also classification; taxonomies
mean length of utterance (MLU) 227, 230 narratives, 9, 17, 122, 195, 255–256, 253, 314
meaning see semantics narrators 39, 67–68, 200, 206, 255, 272
media: files 60, 73, 76–77, 82, 210; ‘objects’ 228 narrow transcription 97
Media Lengua 256; see also mixed languages nasalisation 98, 100, 104, 113, 314
melody 263–264, 271 nasals 98, 104, 109, 296; nasal harmony 113
memory cards 51, 57–58 native speaker intuitions 12, 97, 105–108, 263
metadata 75–78, 193, 264, 273, 275 natural world 20, 158–159
metaphors 87, 196, 263, 274 naturalistic methods 62, 91, 193, 225, 227
meter in poetry 263, 269, 272, 313 Navajo 168, 184, 274, 284
metrical grids 271–272, 314 Nen 13–15, 289
Michif 258; see also mixed languages Nepali 217–218, 225, 230, 289
microphones 50–56, 61–64, 112, 228, neuter see gender
266–267; cables 54; types of 52 New Zealand Sign Language (NZSL) 192,
migrants see urban fieldwork 209, 289
326
326 Index
Index 327
328 Index
telic 134; see also Aktionsart vehicles 41–42, 50, 59; see also transport
temporal arts 261, 315 verbal arts see poetry; song
tense 125–126, 133, 135, 145, 283; and verb classes see conjugation classes
language acquisition 217–218 verbs 125, 131, 133–137, 161, 168, 217–218;
Teop 172 transitivity 126, 129, 131, 136, 139, 219;
text-setting 263 see also complex predicates
Thrace Romani 242–243, 246, 252–253, verse 263, 272, 282, 313, 315
256–257, 289 video recording 65–68; equipment 55–57;
TLA (The Language Archive) 85, 220, 315–316 see also cameras
Tlingit 5, 159, 289 Vietnamese 142
Tohono O’odham 171, 272 visa 41–43
Tok Pisin 14–15; see also creole language visual prompts see non-linguistic stimuli
tokens 99 vocabulary: eliciting 154–160
tonal: categories 102, 107–108; contrasts 97, voice: active 36, 136–137; antipassive
106–107, 109, 111 136–137; middle 136; passive 11, 136, 139,
tonal languages 107, 118 147, 233, 235
tone 107–108, 115, 143, 263, 314 voicing 98, 100–102, 104, 106, 109, 258
Toolbox 117, 175, 177 vowels: phonological analysis 100–105; stress
topic see information structure 108–109; in song 269, 296–297
Torres Strait 87, 129, 237, 315 vulnerable populations 8, 35, 222;
trademark see intellectual property rights see also Indigenous peoples
Traditional Knowledge (TK) licences 33, 315
training: cross-cultural 43; linguistic 3, 5, Walmajarri 34
29–31, 191; literacy 13 Warlpiri 122–124, 139, 172, 219, 229, 231–233,
Transcriber 82, 276 278, 289; see also Light Warlpiri
transcribers 77, 225, 230 waveform 63, 80, 199, 271; analysis 98–99
transcription 97 weather: and fieldwork 42, 59–61; and
transfer 252 recording 229, 294
transitivity see verbs welfare policies 222
translation see back translation windshields 53–54, 63
translators 11, 77, 274; see also interpreters word classes 101, 124–125, 145–146;
transport 27, 42, 50–51; see also vehicles see also parts of speech
tripods 56–57, 66–67, 198 word order 112, 124, 131–132, 231–233,
Turkish 190, 242–243, 246, 252, 256–257 246, 257
typology 2, 160, 171, 189, 221 word boundaries 99, 102, 272, 282
Tzeltal 141, 219, 234 wordlists 154–155, 167
wug words 110, 230, 315
ultrasound 113
unicode fonts 100, 177 Xanthi 243
universities 29, 34, 84, 281 XLR 52, 54–56, 58, 68, 228; see also cables
unique identifier 73
urban fieldwork 6–7, 31, 188, 194, 245, Yélî Dnye 104–105, 141, 153, 156, 161, 289
286, 290 Yolŋu Matha 179
USB sticks 58, 90, 276, 280–281 youth see language acquisition
Yucatec Maya 196, 214, 289
vaccinations 41–42, 224; see also health Yucatec Maya Sign Language (YMSL)
variation: and contact languages 246–248, 196, 289
251, 253, 257; inter-speaker 2, 12, 157, 160; Yuman languages 132
and song 9, 261–282; cultural knowledge
31, 62; recording 63, 66; archiving 84, 86; zeugma 303
phonology 97 zooming 66