Mass conversion of CHAT-files (to TextGrid)?
Up one level
Mass conversion of CHAT-files (to TextGrid)?
Hi,
I whant to convert allt the .cha-files I downloaded from the ESF Corpus made public through the MPI archive system. I have tried CLANS chat2praat program, but it fails. This is the log:
> dir *.cha
cha liean11b.1.cha liean12e.1.cha liean13g.1.cha
liean14a.1.cha liean14a.2.cha liean16a.1.cha liean16k.1.cha
liean17l.1.cha liean18a.1.cha liean18c.1.cha liean22a.1.cha
liean22e.1.cha liean22g.1.cha liean23c.1.cha liean24a.1.cha
liean24i.1.cha liean25a.1.cha liean25k.1.cha liean27l.1.cha
liean31a.1.cha liean31d.1.cha liean32a.1.cha liean32e.1.cha
liean32g.1.cha liean32h.1.cha liean34q.1.cha liean35j.1.cha
test.cha unixfile.cha
30 files, 0 directories
> chat2praat +e.wav liean11a.1.cha
chat2praat +e.wav liean11a.1.cha
Thu Aug 21 18:01:19 2008
chat2praat (28-Jul-2008) is conducting analyses on:
ALL speaker tiers
and those speakers' ALL dependent tiers
and ALL header tiers
****************************************
From file <liean11a.1.cha> to file <liean11a.1.textGrid>
*** File "liean11a.1.cha": line 16.
Illegal speaker character found: 15.
CURRENT OUTPUT FILE "liean11a.1.textGrid" IS INCOMPLETE.
Line 16 of the file is:
^U%snd: "liean11a.wav" 9383 13613^U
where ^U seems to be a control character that I don't know.
ELAN seems to be able to open the files fine and convert them to TextGrids. However, since there are a lot of cha-files, one would like to be able to batch-convert them. Does anyone know whether there are any stand-alone conversion tools available? Or maybe, what the ^U control sequence is so that I may remove it with tr or something like that.
I on a Mac OS X (Leopard) computer.
I would greatly appreciate any help I could get on this.
/Fredrik
/Fredrik
Re: Mass conversion of CHAT-files (to TextGrid)?
The "^U" character should be the unicode character "0015", or 0x0015, in Java code '\u0015'. If it is any other character then that could be the cause of the error.
Have you tried to remove that particular file to see if the other files can be converted?
I don't know of any tool that can do the conversion, batch-wise.
Does the chat2eaf command produce the same error? Otherwise this could be a first step. Indeed, ELAN doesn't offer this kind of batch conversion.
Han