Getting around no text formatting (underlining, italics, etc.)
Up one level
Getting around no text formatting (underlining, italics, etc.)
I have a considerable amount of data transcribed in CHAT format I would like to convert to ELAN format. I have used the CLAN's chat2elan program and more the most part this works. However, my CLAN transcripts contain underlining (to mark stress), and these are not carried over. I read on the forum that ELAN does not allow underlining (or other formatting) within its transcriptions. Does anyone know of a work around for this? I would like to have this information available somehow. I was wondering if it was possible somehow to automatically convert this underlining to something which is allowed in ELAN (e.g. capitals). Any suggestions would be greatly appreciated.
Re: Getting around no text formatting (underlining, italics, etc.)
The best way to retain this information from the original files is if it is dealt with while converting to ELAN. So, you might send a request to the creators of chat2elan? Converting to uppercase might be an option, but adding an extra tier with annotations containing some code or symbol to mark stress would be more ELAN-like.
If this information cannot be retained and you have to mark stress manually in ELAN again, that would be the preferred way: add a depending tier and create annotations for stress (this might include that you have to create tiers for the word or even lower level first). This may be extra overhead but improves searchability etc.
-Han