During 2009-11 I wrote the Bangor Autoglosser to gloss the Bangor ESRC corpora of multilingual (Welsh, Spanish, English) conversational text. I’ve done a new version, Autoglosser2, that focusses on Welsh written text, and outputs CorCenCC tags as well as Bangor-type glosses. Speed has been greatly increased too, from 1,000 to 22,000 glossses/minute. You can test it online, but for detailed work it’s better to download and install locally. There’s also a detailed manual available. Lots of work to do on it still, but it’s pretty robust, and gives reasonably good results.
-
About me
- I'm Kevin Donnelly, and I live in Llanfairpwllgwyngylch gogerychwyrndrobwllllantysiliogogogoch. Most of my projects relate to linguistics in some form or other (largely Welsh in the past), or to stuff like audio, electronics, typesetting, etc that I find interesting, and that I can work with on GNU/Linux. You can contact me directly on my first name, plus dotmon, and then add a com at the end ...
-
You are currently browsing the archives for February, 2018.
-
Archives
- February 2018
- December 2017
- February 2016
- January 2016
- November 2015
- October 2015
- January 2014
- December 2013
- November 2013
- August 2013
- July 2013
- June 2013
- May 2013
- April 2013
- January 2013
- November 2012
- August 2012
- July 2012
- December 2011
- October 2011
- September 2011
- June 2011
- January 2011
- December 2010
- November 2010
- October 2010
- August 2010
- July 2010
- June 2010
- May 2010
- April 2010
- March 2010
- February 2010
- September 2007
- August 2007
- July 2007
- June 2007
- March 2007
- February 2007
- January 2007