French Learner Language Oral Corpora flloc
Home About us Team members Ground rules

The FLLOC project


Other resources

Search Site

Valid XHTML 1.0 Strict

Valid CSS!

Salford Corpus

Transcription conventions

This corpus had to be reformatted in order to adhere the the CHILDES transcription conventions. The following modifications were made to the original transcripts:

Text as one complete block

we had to split the original transcripts into utterances to adhere to CHILDES conventions. We created separate utterances for all main clauses, and split them at the following coordinating conjunctions:

We did not split them at parce que or other subordinating conjunctions and we kept embedded clauses.

Extra syllables

Words that have been pronounced with an extra syllable when compared to ‘usual’ pronunciation are marked like this in the trancripts:


Repetitions and Retracings

these are marked in the transcripts in the following way.

les [/] les petits hommes (repetition)

et puis le [//] les hommes (retracing - one word)

et puis <les hommes> [//] les garçons (retracing - more than one word)


Pauses have been marked in the transcripts where the recording was carried out alone in a booth. The following information is recorded: length of run, number of syllables and length of pause.

Length of run|no. of syllables #length of pause
[^2_04|7] #1_6

(# is the CHAT code for pauses)


Liaisons are marked with les petits [=l] hommes.

Filled Pauses

We used the standard CHAT filled pauses (see the CHILDES transcription conventions for details).

Reported Speech

Reported speech is coded with a a [=r] the reported speech is put in angled brackets < >
il dit <qu’est ce qu’il fait ici>a [=r]


(rire) as a comment is marked as [=! rire] in the original place in the transcript. If a comment is at the start of a line it is preceded by 0 e.g. 0[= ! rire].