2011-12-21 11:04  sloot

	* [r13756] NEWS, configure.ac: bumped version
	  somehow 0.4.8 was already released

2011-12-14 11:10  sloot

	* [r13698] src/ucto.cxx: be more informative

2011-12-06 16:44  sloot

	* [r13652] src/tokenize.cxx: libfolia API is a moving target

2011-12-06 15:06  sloot

	* [r13648] configure.ac, include/ucto/tokenize.h, src/tokenize.cxx:
	  adapted to major change in libfolia: AbstractElement ==>
	  FoliaElement

2011-12-06 15:05  sloot

	* [r13647] tests/testfolia.ok, tests/testfoliain.ok: adapted to new
	  folia

2011-11-28 15:31  sloot

	* [r13613] tests/testfoliain.ok: test results changed

2011-11-28 15:26  sloot

	* [r13612] tests/folia1.xml: stray "

2011-11-28 15:17  sloot

	* [r13611] tests/folia1.xml, tests/folia2.xml, tests/folia3.xml:
	  replaced \n wilt real newline character &#xA;

2011-11-28 15:06  sloot

	* [r13610] src/tokenize.cxx, src/ucto.cxx: attempt to fix -F. still
	  uncertain

2011-11-28 15:05  sloot

	* [r13609] tests/folia.xml, tests/folia1.xml, tests/folia2.xml,
	  tests/folia3.xml, tests/testfoliain, tests/testfoliain.ok: added
	  and changed -F tests
	  Still not sure if this is wahat we want

2011-11-28 14:46  mvgompel

	* [r13608] config/Makefile.am, config/tokconfig-nl-twitter: Added
	  specicial tokeniser configuration for Twitter (dutch)

2011-11-28 14:39  mvgompel

	* [r13607] config/tokconfig-nl, config/url.rule,
	  tests/smileys.nl.tok.V, tests/smileys.nl.txt,
	  tests/urls.nl.tok.V, tests/urls.nl.txt: Added URL-DOMAIN and
	  SMILEY detection (enabled only in tokconfig-nl for now)

2011-11-28 13:27  sloot

	* [r13605] tests/folia.xml, tests/testall, tests/testfoliain,
	  tests/testfoliain.ok: added simple test for the _F FoLiA XML
	  input option

2011-11-28 09:53  sloot

	* [r13601] docs/ucto.1, src/ucto.cxx: small change in usag()
	  Enhanced man page

2011-11-28 09:52  sloot

	* [r13600] tests/testoption1.ok, tests/testusage.ok: again output
	  was changed

2011-11-28 09:48  mvgompel

	* [r13599] src/tokenize.cxx: fix

2011-11-28 09:47  mvgompel

	* [r13598] src/tokenize.cxx: fixes

2011-11-28 09:37  sloot

	* [r13597] tests/testoption1.ok, tests/testusage.ok: output has
	  changed, so adapt test

2011-11-28 09:22  sloot

	* [r13596] src/tokenize.cxx: removed 2 compiler warnings
	  2 others still there: Proycon, what were you thinking?

2011-11-26 19:39  mvgompel

	* [r13587] include/ucto/tokenize.h, src/tokenize.cxx: completed
	  FoLiA input support (needs more testing though)

2011-11-26 10:42  mvgompel

	* [r13586] src/tokenize.cxx: tokenisation of sentences in existing
	  FoLiA now works

2011-11-25 22:17  mvgompel

	* [r13585] src/tokenize.cxx: fixes for FoLiA input support

2011-11-25 17:24  mvgompel

	* [r13583] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  Attempting to add FoLiA read support to ucto.. not entirely
	  finished and tested yet!

2011-11-09 14:06  sloot

	* [r13536] src/tokenize.cxx: added some hacks to avoid a nasty ICU
	  bug.
	  see http://bugs.icu-project.org/trac/ticket/8824

2011-11-02 16:31  sloot

	* [r13511] configure.ac: fix

2011-11-02 16:22  sloot

	* [r13510] src/Makefile.am, src/tst.sh: more lost stuff

2011-11-02 16:16  sloot

	* [r13509] Makefile.am, tests/Makefile.am, tests/bug0083.nl.tok.V,
	  tests/bug0083.nl.txt, tests/test.de.txt, tests/test.en.txt,
	  tests/test.es.txt, tests/test.fr.txt, tests/test.pt.txt,
	  tests/tst.cfg, tests/tst.ok, tests/tst.txt: hmm lost a lot of
	  stuff in the merge :{

2011-11-02 15:44  sloot

	* [r13508] ., AUTHORS, Makefile.am, NEWS, config/Makefile.am,
	  config/tokconfig-en, config/tokconfig-es, config/tokconfig-fr,
	  config/tokconfig-generic, configure.ac, docs/ucto.1,
	  include/ucto/tokenize.h, include/ucto/unicode.h, src/Makefile.am,
	  src/tokenize.cxx, src/tst.sh, src/ucto.cxx, src/unicode.cxx,
	  tests, tests/Makefile.am, tests/abbreviations.nl.tok.V,
	  tests/abbreviations.nl.txt, tests/bug0048.nl.tok.V,
	  tests/bug0048.nl.txt, tests/bug0051.nl.tok.V,
	  tests/bug0051.nl.txt, tests/bug0052.nl.tok.V,
	  tests/bug0052.nl.txt, tests/bug0054.nl.tok.V,
	  tests/bug0054.nl.txt, tests/bug0063.nl.tok.V,
	  tests/bug0063.nl.txt, tests/bug0065.nl.tok.V,
	  tests/bug0065.nl.txt, tests/bug0065b.nl.tok.V,
	  tests/bug0065b.nl.txt, tests/bug0065c.nl.tok.V,
	  tests/bug0065c.nl.txt, tests/bug0067.nl.tok.V,
	  tests/bug0067.nl.txt, tests/bug0075.nl.tok.V,
	  tests/bug0075.nl.txt, tests/bug0083.nl.tok.V,
	  tests/bug0083.nl.txt, tests/datetime.nl.tok.V,
	  tests/datetime.nl.txt, tests/encoding.nl.tok.V,
	  tests/encoding.nl.txt, tests/encoding2.nl, tests/folia.txt,
	  tests/genitive.nl.tok.V, tests/genitive.nl.txt,
	  tests/invoer.rule, tests/ligaturen.nl.tok.V,
	  tests/ligaturen.nl.txt, tests/multisentenceoneline.nl.tok.V,
	  tests/multisentenceoneline.nl.txt, tests/munten.nl.tok.V,
	  tests/munten.nl.txt, tests/normalisation.nl.tok.V,
	  tests/normalisation.nl.txt, tests/nu.nl.tok.V, tests/nu.nl.txt,
	  tests/onesentencemultiline.nl.tok.V,
	  tests/onesentencemultiline.nl.txt, tests/parentheses.nl.tok.V,
	  tests/parentheses.nl.txt, tests/partest_crlf.nl.tok.V,
	  tests/partest_crlf.nl.txt, tests/passthru.txt,
	  tests/passthru2.txt, tests/qtest.nl, tests/qtest.nl.tok,
	  tests/qtest.nl.tok.V, tests/qtest.spec, tests/qtest.spec.tok,
	  tests/qtest.spec.tok.V, tests/quoteproblem.txt,
	  tests/quotetest_multiline.nl.tok.V,
	  tests/quotetest_multiline.nl.txt,
	  tests/quotetest_multisentence.nl.tok.V,
	  tests/quotetest_multisentence.nl.txt,
	  tests/quotetest_nested.nl.tok.V, tests/quotetest_nested.nl.txt,
	  tests/quotetest_nested2.nl.tok.V, tests/quotetest_nested2.nl.txt,
	  tests/quotetest_nosentence.nl.tok.V,
	  tests/quotetest_nosentence.nl.txt,
	  tests/quotetest_onesentence.nl.tok.V,
	  tests/quotetest_onesentence.nl.txt, tests/small.cfg,
	  tests/small.txt, tests/stuk1.cfg, tests/stuk2.cfg,
	  tests/test.de.txt, tests/test.en.txt, tests/test.es.txt,
	  tests/test.fr.txt, tests/test.pt.txt, tests/test.py,
	  tests/testencoding2, tests/testencoding2.ok, tests/testfolia,
	  tests/testfolia.ok, tests/testinclude, tests/testinclude.ok,
	  tests/testnormalisation, tests/testnormalisation.ok,
	  tests/testone, tests/testoption1.ok, tests/testoption2,
	  tests/testoption2.ok, tests/testpassthru, tests/testpassthru.ok,
	  tests/testusage.ok, tests/tst.cfg, tests/tst.ok, tests/tst.txt,
	  tests/units.nl.tok.V, tests/units.nl.txt, tests/urls.nl.tok.V,
	  tests/urls.nl.txt, ucto.pc.in: Merged the folia branch into ucto.
	  ucto now always requires libfolia

2011-10-25 14:21  sloot

	* [r13462] src/tokenize.cxx: some layout changed and slack removed

2011-10-25 12:47  sloot

	* [r13459] NEWS, src/tokenize.cxx: small changes

2011-10-24 13:41  sloot

	* [r13449] src/tokenize.cxx: made compiler happy.
	  Removed dead code

2011-10-04 08:31  mvgompel

	* [r13366] config/tokconfig-es: support for spanish ¿ and ¡

2011-10-04 08:07  mvgompel

	* [r13365] config/tokconfig-es: update

2011-09-30 10:29  mvgompel

	* [r13349] config/Makefile.am: update

2011-09-30 10:27  mvgompel

	* [r13348] config/tokconfig-generic: generic tokeniser
	  configuration, not for any particular language (not recommended)

2011-09-29 19:38  mvgompel

	* [r13347] config/tokconfig-en: English tokenisation: wo n't , do
	  n't

2011-09-29 19:25  mvgompel

	* [r13346] config/tokconfig-es: Added spanish configuration

2011-09-29 17:31  mvgompel

	* [r13345] config/tokconfig-fr: Added french abbreviations

2011-09-29 17:19  mvgompel

	* [r13344] config/tokconfig-fr: updates to french configuration

2011-09-29 16:12  mvgompel

	* [r13342] config/tokconfig-en, tests/test.en.txt: update English
	  configuration

2011-09-29 15:45  mvgompel

	* [r13341] tests/test.de.txt: correction

2011-09-29 15:45  mvgompel

	* [r13340] tests/test.de.txt: correction

2011-09-29 15:43  mvgompel

	* [r13339] tests/test.de.txt: generic german test

2011-09-29 15:12  mvgompel

	* [r13336] tests/test.pt.txt: correction

2011-09-29 15:11  mvgompel

	* [r13335] tests/test.pt.txt: corrections

2011-09-29 15:07  mvgompel

	* [r13334] tests/test.pt.txt: Generic portuguese test

2011-09-29 14:53  mvgompel

	* [r13333] tests/test.es.txt: Generic Spanish test

2011-09-29 14:36  mvgompel

	* [r13332] tests/test.fr.txt: more french

2011-09-29 14:29  mvgompel

	* [r13331] tests/test.fr.txt: Initial generic french test.

2011-09-29 14:06  mvgompel

	* [r13330] tests/test.en.txt: generic English test

2011-09-15 13:51  sloot

	* [r13189] tests/Makefile.am: last small oesp

2011-09-15 13:46  sloot

	* [r13188] Makefile.am, configure.ac, src/Makefile.am, src/tst.sh,
	  tests, tests/Makefile.am, tests/tst.cfg, tests/tst.ok,
	  tests/tst.txt: attempt to provide 'make check' and 'make
	  distcheck'

2011-08-31 08:23  sloot

	* [r13043] configure.ac: Bumped version after Release

2011-08-23 14:17  sloot

	* [r12956] tests/bug0083.nl.tok.V: attempt for more precise desired
	  version\

2011-08-23 14:09  sloot

	* [r12955] tests/bug0083.nl.tok.V, tests/bug0083.nl.txt: added an
	  other quoteing problem and the disered output

2011-08-23 12:33  sloot

	* [r12950] configure.ac: bumped version after release

2011-08-23 12:32  sloot

	* [r12949] NEWS: release!

2011-08-23 10:07  sloot

	* [r12942] src/tokenize.cxx: the API should have the same defaults
	  ofcourse

2011-08-23 09:37  sloot

	* [r12940] tests/test.py, tests/testfolia, tests/testoption1.ok,
	  tests/testusage.ok: adapted test to changed QuoteDetection
	  regime.
	  hardcoded -Q in test.py, arghh.

2011-08-23 08:52  sloot

	* [r12938] NEWS, docs/ucto.1, src/tokenize.cxx, src/ucto.cxx:
	  Reversed working of -Q
	  Updated man
	  more NEWS

2011-08-22 14:01  sloot

	* [r12931] include/ucto/tokenize.h, src/tokenize.cxx, src/ucto.cxx:
	  applied same changes to readsettings an init as in the libfolia
	  branch

2011-08-22 14:00  sloot

	* [r12930] tests/testconf1.ok, tests/testconf2.ok,
	  tests/testfiles1.ok, tests/testinclude.ok, tests/testlanguage.ok:
	  outpu is changed a little bit

2011-08-18 16:01  mvgompel

	* [r12909] NEWS, src/tokenize.cxx, tests/quoteproblem.txt,
	  tests/quotetest_nested2.nl.tok.V, tests/quotetest_nested2.nl.txt:
	  Fixed nested quote issue (unbalanced quote/sentence issue) and
	  build in extra protections. Incl. test

2011-07-28 10:35  mvgompel

	* [r12749] tests/abbreviations.nl.tok.V, tests/bug0048.nl.tok.V,
	  tests/bug0051.nl.tok.V, tests/bug0052.nl.tok.V,
	  tests/bug0054.nl.tok.V, tests/bug0063.nl.tok.V,
	  tests/bug0065.nl.tok.V, tests/bug0065b.nl.tok.V,
	  tests/bug0065c.nl.tok.V, tests/bug0067.nl.tok.V,
	  tests/bug0075.nl.tok.V, tests/datetime.nl.tok.V,
	  tests/encoding.nl.tok.V, tests/genitive.nl.tok.V,
	  tests/ligaturen.nl.tok.V, tests/multisentenceoneline.nl.tok.V,
	  tests/normalisation.nl.tok.V, tests/nu.nl.tok.V,
	  tests/onesentencemultiline.nl.tok.V, tests/parentheses.nl.tok.V,
	  tests/quotetest_multiline.nl.tok.V,
	  tests/quotetest_multisentence.nl.tok.V,
	  tests/quotetest_nested.nl.tok.V,
	  tests/quotetest_nosentence.nl.tok.V,
	  tests/quotetest_onesentence.nl.tok.V, tests/test.nl.tok.V,
	  tests/urls.nl.tok.V: fixed ucto tests \o/

2011-06-07 14:59  mvgompel

	* [r10447] tests/quoteproblem.txt: added

2011-05-24 13:42  sloot

	* [r10238] NEWS: updated NEWS

2011-05-24 13:33  sloot

	* [r10237] tests/testall, tests/testfolia.ok, tests/testone,
	  tests/testoption1.ok, tests/testpassthru, tests/testpassthru.ok,
	  tests/testusage.ok: added and improved tests for passthru
	  updated a few .ok files.
	  
	  I hope proycon fixes all .V.tok files

2011-05-24 13:17  sloot

	* [r10236] src/tokenize.cxx, src/ucto.cxx: fixed encoding problem
	  with passthru mode.
	  Added --passthru option to usage()

2011-05-24 09:24  sloot

	* [r10223] src/tokenize.cxx: attempt to handle Windows™ files with
	  \r\n.
	  Just strip the last \r

2011-05-24 08:38  sloot

	* [r10220] tests/partest_crlf.nl.txt: converted the file to CRLF
	  and set the svn:eol-style property to CRLF

2011-05-24 07:47  mvgompel

	* [r10214] tests/partest_crlf.nl.tok.V, tests/partest_crlf.nl.txt:
	  added test

2011-05-23 21:40  mvgompel

	* [r10209] src/tokenize.cxx: I think I fixed bug77 (the NOSPACE
	  bug), but now all tests fail (understandably), they will have to
	  be checked thoroughly checked and updated.

2011-05-23 07:49  sloot

	* [r10156] tests/folia.txt, tests/testall, tests/testfolia,
	  tests/testfolia.ok: added a simple FoLiA test

2011-05-20 07:42  mvgompel

	* [r10121] NEWS, src/tokenize.cxx: Fix: proper XML entities in
	  FoLiA output

2011-05-17 07:59  sloot

	* [r10015] configure.ac: bumped version after release

2011-05-17 07:53  sloot

	* [r10013] NEWS: set date for release

2011-05-16 10:16  sloot

	* [r9980] NEWS: updated NEWS too

2011-05-16 10:15  sloot

	* [r9979] src/tokenize.cxx: avoid crash when no new tokens are
	  found because of emptyp words
	  (this might happen on Mac and DOS files with \r line separators )

2011-05-11 15:08  sloot

	* [r9871] src/tokenize.cxx: ok, this is the ugly code :{

2011-05-11 14:59  sloot

	* [r9867] tests/testpassthru.ok: sort of fixed <utt> handling in
	  passthru
	  code gets ugglier every day ;{

2011-05-11 14:55  sloot

	* [r9866] tests/passthru2.txt, tests/testpassthru,
	  tests/testpassthru.ok: added a better passthru test

2011-05-11 13:16  sloot

	* [r9863] NEWS: more News

2011-05-11 13:11  sloot

	* [r9862] src/tokenize.cxx, src/ucto.cxx: small change in
	  configfile handling.
	  Now the messages to stdout are correct.

2011-05-11 13:08  sloot

	* [r9861] tests/testconf1.ok, tests/testconf2.ok,
	  tests/testinclude.ok: results ar slighlty different after small
	  code reshuffle

2011-05-11 08:37  sloot

	* [r9855] tests/invoer.rule, tests/small.cfg, tests/small.txt,
	  tests/testall, tests/testinclude, tests/testinclude.ok: added a
	  test for inclusion

2011-05-11 08:36  sloot

	* [r9854] tests/testconf1.ok: small change after bug fix

2011-05-10 15:46  sloot

	* [r9853] src/tokenize.cxx: fixed include file problem. I think. :)
	  When a configfile with a (absolute/relative) path is provided (-c
	  option)
	  the include files will be searched in the same dir as that config
	  file

2011-05-10 10:32  sloot

	* [r9845] tests/bug0072.nl.tok.V, tests/bug0072.nl.txt,
	  tests/bug0075.nl.tok.V, tests/bug0075.nl.txt: wrong name for this
	  bug

2011-05-10 08:38  sloot

	* [r9844] src/tokenize.cxx: KNOWN Abbreviations are case sensitive
	  now. Fixes bug 72

2011-05-10 08:19  sloot

	* [r9843] tests/bug0072.nl.tok.V, tests/bug0072.nl.txt: added bug72

2011-04-27 16:50  mvgompel

	* [r9732] configure.ac: version bump after release

2011-04-27 16:38  mvgompel

	* [r9730] NEWS: updated NEWS

2011-04-27 16:38  mvgompel

	* [r9729] src/tokenize.cxx: Implemented sentenceperlineinput
	  support directly in DetectSentenceBounds and PassTru (previously
	  unsupported)

2011-04-19 15:15  sloot

	* [r9591] include/ucto/tokenize.h, src/tokenize.cxx: also %include
	  for [EOSMARKERS]

2011-04-19 15:15  sloot

	* [r9590] config/Makefile.am, config/exotic-eos.eos,
	  config/exotic-quotes.quote, config/ligatures.filter,
	  config/standard-eos.eos, config/standard-quotes.quote,
	  config/tokconfig-de, config/tokconfig-nl: added common files for
	  inclusion in config

2011-04-19 14:54  sloot

	* [r9589] config/Makefile.am, config/tokconfig-nl,
	  include/ucto/tokenize.h, src/tokenize.cxx: now [FILTER] and
	  [QUOTES] also honour %include requests in the config
	  For [FILTER] the file extension MUST be .filter
	  For [QUOTES] the file extension MUST be .quote

2011-04-19 13:55  sloot

	* [r9582] config/tokconfig-nl, src/tokenize.cxx: fixed bug 67.
	  Now it is possible to add an empty replacement in the [FILTER}
	  block

2011-04-19 13:12  sloot

	* [r9581] tests/bug0067.nl.tok.V, tests/bug0067.nl.txt: added a new
	  bug test

2011-04-07 14:38  mvgompel

	* [r9350] src/tokenize.cxx: no schemalocation for FoLiA

2011-04-07 09:35  sloot

	* [r9313] include/ucto/tokenize.h, src/tokenize.cxx: arghh

2011-04-07 09:18  sloot

	* [r9311] include/ucto/tokenize.h: hmpff.

2011-04-07 09:07  sloot

	* [r9310] NEWS, include/ucto/tokenize.h, src/ucto.cxx: added
	  functions to display version info.
	  Updated NEWS

2011-04-07 08:42  sloot

	* [r9309] src/ucto.cxx: return conforming exit status.

2011-04-04 12:08  sloot

	* [r9271] configure.ac: bumped version after release

2011-04-04 12:08  sloot

	* [r9270] NEWS: news!

2011-04-04 09:47  mvgompel

	* [r9268] src/tokenize.cxx, tests/bug0065.nl.tok.V,
	  tests/bug0065b.nl.tok.V, tests/bug0065b.nl.txt,
	  tests/bug0065c.nl.tok.V, tests/bug0065c.nl.txt: fixed bug0065

2011-04-04 08:09  mvgompel

	* [r9267] tests/bug0065.nl.txt: new bug

2011-03-31 15:57  sloot

	* [r9261] NEWS, src/tokenize.cxx, src/unicode.cxx: Ok, now ALL
	  unicode strings are output as UTF8.
	  Just to be sure.

2011-03-31 14:56  sloot

	* [r9258] src/tokenize.cxx: Attempt to fix output encoding
	  problems. We want it to be UTF8.

2011-03-31 12:33  sloot

	* [r9240] moved ucto to its final destination

2011-03-19 17:13  joostvb

	* [r8975] new upstream checksum

2011-03-19 17:11  joostvb

	* [r8974] new upstream, new homepage

2011-03-19 10:49  joostvb

	* [r8970] start 0.4.4 development

2011-03-19 10:44  joostvb

	* [r8969] release 0.4.3

2011-03-19 10:44  joostvb

	* [r8968] record changes

2011-03-17 15:48  sloot

	* [r8936] fixed --passthru problem

2011-03-17 15:48  sloot

	* [r8935] test now works

2011-03-17 09:31  sloot

	* [r8896] bumped version after release

2011-03-16 13:41  sloot

	* [r8877] passes -pedantic

2011-03-16 12:59  sloot

	* [r8871] some cleanup
	  bumped version

2011-03-14 11:12  sloot

	* [r8843] added (hidden) --passthru option

2011-03-14 11:12  sloot

	* [r8842] added a passthru test.
	  has t0 be tested :)

2011-03-14 10:07  sloot

	* [r8839] initialize your variables, please.

2011-03-14 10:01  sloot

	* [r8838] make compiler more happy

2011-03-14 09:16  sloot

	* [r8837] added description

2011-03-14 09:02  sloot

	* [r8836] smal update

2011-03-12 15:48  joostvb

	* [r8811] new upstream

2011-03-11 16:53  mvgompel

	* [r8796] fixed passthruline (skip=t) bug

2011-03-08 16:01  sloot

	* [r8751] fixed regexp and error messag

2011-03-08 15:54  sloot

	* [r8750] added the possiblity to ste the order of RULES in the
	  config file

2011-03-08 15:52  sloot

	* [r8749] strange a space has left the building

2011-03-08 13:47  mvgompel

	* [r8748] fix

2011-03-08 13:47  mvgompel

	* [r8747] override

2011-03-08 13:42  mvgompel

	* [r8746] fix bug0063

2011-03-08 12:59  sloot

	* [r8745] added a test for bug63
	  Not sure about the 'correct' solution

2011-03-08 10:02  sloot

	* [r8742] updated man page

2011-03-07 16:05  mvgompel

	* [r8726] FoLiA XSL has to be local unfortunately

2011-03-07 14:42  mvgompel

	* [r8721] fix

2011-03-07 14:38  mvgompel

	* [r8720] fix

2011-03-04 13:26  mvgompel

	* [r8687] added logo

2011-03-04 11:44  mvgompel

	* [r8683] fix

2011-03-03 15:23  mvgompel

	* [r8678] oops

2011-03-03 15:22  mvgompel

	* [r8677] no body in FoLiA anymore

2011-03-01 13:59  sloot

	* [r8637] bumped version. seem a good moment for 0.4

2011-03-01 13:43  sloot

	* [r8636] these tests give a different outcome now.

2011-02-28 15:36  mvgompel

	* [r8620] oops: the sequel

2011-02-28 15:35  mvgompel

	* [r8619] oops

2011-02-28 15:32  mvgompel

	* [r8618] sentence per line input and output: two modes

2011-02-28 15:11  mvgompel

	* [r8617] Fixed bug 54

2011-02-28 13:46  mvgompel

	* [r8615] fixes

2011-02-28 13:46  mvgompel

	* [r8614] FoLiA improvement

2011-02-27 15:34  joostvb

	* [r8593]

2011-02-27 15:06  joostvb

	* [r8592] first shot at ucto rpm

2011-02-27 10:35  joostvb

	* [r8576] split, in order to be able to build frog

2011-02-27 08:45  joostvb

	* [r8565] shlibs stuff. ship it

2011-02-26 15:31  joostvb

	* [r8563] first shot at building ucto for fink/macosx

2011-02-14 13:35  sloot

	* [r8318] added experimental code to use the -n option ( output one
	  sentence per line) also to process the input one sentence per
	  line

2011-02-14 13:18  sloot

	* [r8317] testcase for bug0054

2011-02-12 18:09  joostvb

	* [r8299] 0.3.6 is released. start 0.3.7

2011-02-12 18:04  joostvb

	* [r8298] ship it

2011-02-12 18:00  joostvb

	* [r8297] record more changes

2011-02-12 17:42  joostvb

	* [r8296] record some changes

2011-02-10 16:59  sloot

	* [r8291] missing file added

2011-02-10 16:33  sloot

	* [r8289] added a test for the -e option

2011-02-10 16:18  sloot

	* [r8288] ok, accept any encoding string, now.
	  Throw up when not known.

2011-02-10 16:03  sloot

	* [r8286] adapted tests to the reality

2011-02-10 15:43  sloot

	* [r8283] added possibility to set the inputEncoding
	  breaks ucto user interface!

2011-02-10 11:51  mvgompel

	* [r8280] version bump

2011-02-10 11:51  mvgompel

	* [r8279] Fixed quote problems bug0052, bug0051

2011-02-09 12:27  mvgompel

	* [r8274] debug fix

2011-02-09 12:04  mvgompel

	* [r8273] fixed tests

2011-02-09 11:55  mvgompel

	* [r8272] added more debug

2011-02-09 11:43  mvgompel

	* [r8271] updated error messages

2011-02-09 11:08  sloot

	* [r8270] better error message

2011-02-08 17:14  sloot

	* [r8264] more useful naming

2011-02-08 17:07  sloot

	* [r8263] ok, this is what is meant.

2011-02-08 16:43  sloot

	* [r8259] make the test fail again. this is dead wrong

2011-02-08 16:27  sloot

	* [r8258] added a demo for bug52

2011-02-08 10:27  sloot

	* [r8251] fixed bug51. Not realy happy

2011-02-08 10:24  sloot

	* [r8250] this is what we want

2011-02-08 09:39  sloot

	* [r8249] added a demo of bug51

2011-02-07 16:17  sloot

	* [r8243] New test. OK file is NOT OK :)

2011-02-07 16:09  sloot

	* [r8242] cleanup

2011-02-07 14:13  sloot

	* [r8237] bit of cleanup

2011-02-03 15:31  sloot

	* [r8216] fix memory leak

2011-02-03 15:24  sloot

	* [r8215] - testall now tests all.
	  - hacked around in python without real knowledge to support
	  Valgrind testing

2011-02-03 13:00  sloot

	* [r8212] removed strange stray '\'

2011-02-03 12:55  sloot

	* [r8211] last "elegant" solution was dead wrong (but passed all
	  tests)
	  this one is much better (and passes all tests)
	  Maybe the tests are wrong?

2011-02-03 11:00  sloot

	* [r8206] more elegant ;)

2011-02-03 10:57  sloot

	* [r8205] .

2011-02-03 10:00  sloot

	* [r8204] added a test for invalid combination of options

2011-02-03 09:59  sloot

	* [r8203] make sure -L and -c don't intermix

2011-02-02 15:42  sloot

	* [r8196] props set

2011-02-02 14:14  sloot

	* [r8186] ID ==> Id

2011-02-02 14:12  sloot

	* [r8184] tag

2011-02-02 14:11  sloot

	* [r8183] tags

2011-02-02 14:08  sloot

	* [r8182] more copyright

2011-02-02 14:05  sloot

	* [r8181] added copyright notice

2011-02-02 11:53  sloot

	* [r8171] we want these fuction in the std namespace, i think

2011-02-02 11:35  sloot

	* [r8170] more clean implementation. hide normalization in a Class

2011-02-02 11:34  sloot

	* [r8169] changed spelling

2011-02-02 10:17  sloot

	* [r8168] put some usefull unicode stuff in separate module

2011-02-01 15:41  sloot

	* [r8155] this one was missing

2011-02-01 15:34  mvgompel

	* [r8154] nog een poging

2011-02-01 15:30  mvgompel

	* [r8153] fix

2011-02-01 15:24  mvgompel

	* [r8152] back geport

2011-02-01 15:12  sloot

	* [r8151] oesp

2011-02-01 14:26  sloot

	* [r8148] renamed. use same number as in Bugzilla:
	  https://ilk.uvt.nl/bugzilla3//show_bug.cgi?id=48

2011-02-01 14:15  sloot

	* [r8146] Ok, it is a bit harsh, but strip all path info, to make
	  the tests worl "everywhere"

2011-02-01 11:42  sloot

	* [r8136] testresults have changed

2011-02-01 11:11  sloot

	* [r8135] better naming

2011-02-01 11:09  sloot

	* [r8134] added a test for several Normalisation types

2011-02-01 10:45  sloot

	* [r8133] now actualy use -e values for normalisation

2011-02-01 10:25  sloot

	* [r8132] -f option now works

2011-02-01 10:25  sloot

	* [r8131] better test

2011-01-31 16:52  sloot

	* [r8126] cleaned up non-dutch eosmarker
	  attempt to add 0x0a (\n) as eosmarker failed

2011-01-31 15:43  sloot

	* [r8123] new test added (.V is empty, to gat a nice diff)

2011-01-31 15:12  sloot

	* [r8122] test more

2011-01-31 15:04  sloot

	* [r8121] this is the desired result (NFC)

2011-01-31 15:01  sloot

	* [r8120] more to-the-point naming

2011-01-31 14:47  sloot

	* [r8119] added support for ligature filtering and Unicode
	  normalizing.
	  a bit rough still

2011-01-31 13:40  sloot

	* [r8117] now uses ucto-icu.pc

2011-01-31 10:46  sloot

	* [r8091] to please Maarten even more

2011-01-31 10:45  sloot

	* [r8090] zo beter dan Maarten?

2011-01-31 10:32  sloot

	* [r8086] our first officila bug!

2011-01-27 16:28  joostvb

	* [r8060] cosmetics

2011-01-27 16:18  sloot

	* [r8059] props set

2011-01-27 16:14  joostvb

	* [r8058] release 0.3.4, record changes

2011-01-27 16:13  sloot

	* [r8057] props

2011-01-27 16:08  sloot

	* [r8056] propset

2011-01-27 16:08  sloot

	* [r8055] propset

2011-01-27 16:06  sloot

	* [r8054] more tests
	  forgot some files

2011-01-27 16:06  joostvb

	* [r8053] rename icu.pc to ucto-icu.pc: be sure we wont suffer from
	  filename clashes in the future

2011-01-27 16:01  sloot

	* [r8052] more tests added

2011-01-27 16:00  mvgompel

	* [r8051] Updates in FoLiA support

2011-01-27 15:50  sloot

	* [r8050] better?

2011-01-27 15:46  sloot

	* [r8049] more tests

2011-01-27 15:38  sloot

	* [r8047] added some scripts to test illegal parameters etc.

2011-01-27 13:51  sloot

	* [r8043] een interessante test. De .V is NU wel ok, maar ik vind
	  dat ie fout moet worden ;)

2011-01-27 13:44  sloot

	* [r8042] added a test with combining vs precombined characters

2011-01-27 12:23  joostvb

	* [r8040] release 0.3.3

2011-01-27 12:18  joostvb

	* [r8039] record changes and releases

2011-01-27 11:57  joostvb

	* [r8038] tags

2011-01-27 11:57  joostvb

	* [r8037] honor --sysconfdir in ./configure call

2011-01-27 11:56  joostvb

	* [r8036] tags

2011-01-27 11:45  sloot

	* [r8034] added missing output operator declaration

2011-01-27 11:13  sloot

	* [r8030] bumped version after "release" of 0.3.1

2011-01-26 17:17  sloot

	* [r8019] install makefile too

2011-01-26 17:08  sloot

	* [r8018] fixed bug, bumped version, added Makefile and such

2011-01-18 15:34  sloot

	* [r7866] some formatting, comment and such

2011-01-18 15:09  sloot

	* [r7865] leaner ande meaner

2011-01-18 15:04  sloot

	* [r7864] refactoring detectSentenceBounds()
	  tricky!

2011-01-18 14:35  sloot

	* [r7863] much simpler. No limit to length of "opening quote"
	  string.

2011-01-18 12:46  sloot

	* [r7854] added << operator for TokenRole (and use is)

2011-01-18 12:37  sloot

	* [r7853] get rid of unused function

2011-01-18 12:34  sloot

	* [r7852] made clear() private.

2011-01-18 12:21  sloot

	* [r7851] make stream parameters a bit more C++ like

2011-01-18 10:57  sloot

	* [r7849] cleaned up TokenRole stuff

2011-01-18 10:32  sloot

	* [r7848] refactoring...

2011-01-18 10:05  sloot

	* [r7847] some refactoring going on

2011-01-13 16:46  sloot

	* [r7831] comment added

2011-01-13 16:45  sloot

	* [r7830] all configs include the same URL and E-MAIL rules now

2011-01-13 16:34  sloot

	* [r7829] minor version bump

2011-01-13 16:33  sloot

	* [r7828] ignore sysconfdir when filename contains a '/' ( so for
	  ../rel/ati/ve and /abs/olute/path )

2011-01-13 16:27  sloot

	* [r7827] barf louder when no config found

2011-01-13 16:22  sloot

	* [r7826] nu wel

2011-01-13 16:20  sloot

	* [r7825] uninclude

2011-01-13 15:53  sloot

	* [r7824] e-mail rule created

2011-01-13 15:48  sloot

	* [r7823] added simpel include mechanism. Not very robust.
	  Dont forget to add .rule files to config/Makefile.am !

2011-01-13 14:57  mvgompel

	* [r7822] fix

2011-01-13 14:41  mvgompel

	* [r7821] fixes and test update

2011-01-13 14:32  mvgompel

	* [r7820] added detectQuoteBounds

2011-01-13 14:24  sloot

	* [r7819] adde a few more sensible default

2011-01-13 14:23  sloot

	* [r7818] added a file to store useful tips for the upcoming manual

2011-01-13 14:10  mvgompel

	* [r7817] added test text from NU.nl

2011-01-13 14:07  mvgompel

	* [r7816] renamed

2011-01-13 14:03  mvgompel

	* [r7815] Added lots of tests

2011-01-13 13:52  sloot

	* [r7814] overthrown

2011-01-13 13:41  sloot

	* [r7813] less exit, more throw

2011-01-13 13:22  sloot

	* [r7811] added a setter voor the Error logfile
	  otherwise logging/debugging is impossible when ucto is included
	  in a daemon
	  (which as we all now, closes stderr)
	  This was the sole reason for having theErrLog anyway

2011-01-13 12:04  mvgompel

	* [r7810] fix

2011-01-13 12:02  mvgompel

	* [r7809] test wrapper done

2011-01-13 12:02  mvgompel

	* [r7808] added quote tests, updated test script

2011-01-13 12:02  mvgompel

	* [r7807] fix

2011-01-13 12:01  sloot

	* [r7806] use our own exceptions

2011-01-13 11:44  mvgompel

	* [r7805] fixes

2011-01-13 11:44  mvgompel

	* [r7804] added test wrapper

2011-01-13 11:38  sloot

	* [r7803] beterst

2011-01-13 11:29  sloot

	* [r7802] beterder

2011-01-13 11:26  sloot

	* [r7801] throw more sensible exceptions

2011-01-12 15:52  sloot

	* [r7798] propset

2011-01-12 15:43  sloot

	* [r7797] quotation separator is now whitespace (spaces and/or
	  tabs)

2011-01-12 14:49  sloot

	* [r7796] added changed testresults

2011-01-12 14:48  sloot

	* [r7795] better handle the 'multiple opening, single closing'
	  quotation problem
	  added test

2011-01-12 14:24  sloot

	* [r7794] the test was changed. results are not OK, I think.
	  (nested quotes)

2011-01-12 14:22  sloot

	* [r7793] first attempt to read QUOTES from config file
	  syntax undecided yet
	  what about reasonable defaults?

2011-01-12 13:37  sloot

	* [r7792] started enhancin quote handling

2011-01-12 13:29  mvgompel

	* [r7791] small adaptation to FoLiA format

2011-01-11 17:32  sloot

	* [r7788] test.nl now seems to work
	  added something naughty to qtest.spec

2011-01-11 17:18  mvgompel

	* [r7787] trailing space fix

2011-01-11 16:45  sloot

	* [r7786] first step in more clean quote handling

2011-01-11 15:59  sloot

	* [r7785] added a simple test for 'special' quotes.

2011-01-11 15:27  sloot

	* [r7784] somewaht better results, but still strange!

2011-01-11 15:01  mvgompel

	* [r7783] fix

2011-01-11 14:38  mvgompel

	* [r7781] oops, forgot )

2011-01-11 14:32  mvgompel

	* [r7780] omgekeerd is beter denk ik

2011-01-11 14:32  mvgompel

	* [r7779] fixes

2011-01-11 14:29  sloot

	* [r7778] added test for quotation

2011-01-11 14:24  mvgompel

	* [r7777] Important bugfixes!

2011-01-11 13:26  mvgompel

	* [r7776] extra debug level

2011-01-11 13:02  sloot

	* [r7775] created a tests directory.
	  added tokenized ans Verbose Tokenized output for comparision

2011-01-11 11:59  sloot

	* [r7774] fixed all warnings.
	  Not very smart yet.

2011-01-11 09:16  sloot

	* [r7773] updatet Man page

2011-01-10 18:03  mvgompel

	* [r7771] removed some obsolete things

2011-01-10 18:01  mvgompel

	* [r7770] fixes, quote output in FoLiA works now

2011-01-10 17:45  mvgompel

	* [r7769] fixes

2011-01-10 17:33  mvgompel

	* [r7768] bugfixes

2011-01-10 17:15  mvgompel

	* [r7767] improved debug mode + bugfixes

2011-01-10 15:44  mvgompel

	* [r7765] Added a new passthru mode for pre-tokenised text

2011-01-10 14:58  mvgompel

	* [r7762] oops 2, removed debug line

2011-01-10 14:57  mvgompel

	* [r7761] oops, fixed segfault

2011-01-10 14:29  mvgompel

	* [r7760] major refactoring. Improved buffering, less unnecessary
	  storing of token/sentence vectors in memory. Improved quote
	  support.

2011-01-10 11:42  mvgompel

	* [r7759] set BOS in sentences in quotes

2011-01-10 11:36  mvgompel

	* [r7758] fixed EOS detection before punctuation

2011-01-10 11:12  mvgompel

	* [r7756] working on proper output with quote support

2011-01-07 17:49  mvgompel

	* [r7754] updated some options

2011-01-07 15:54  mvgompel

	* [r7753] version bump

2011-01-07 15:49  mvgompel

	* [r7752] Ucto now remembers if a token was spaced or not in the
	  original. Enabling ucto to recontruct the original text exactly.

2011-01-07 14:04  mvgompel

	* [r7749] Added quote detection support

2011-01-06 18:25  mvgompel

	* [r7746] update?

2011-01-06 18:04  mvgompel

	* [r7745] more debug update

2011-01-06 18:02  mvgompel

	* [r7744] debug updates

2011-01-06 17:52  mvgompel

	* [r7743] updated help

2011-01-06 17:49  mvgompel

	* [r7742] more sensible debug messages

2011-01-06 17:35  mvgompel

	* [r7741] I think that all changes merit a version bump now

2011-01-06 17:35  mvgompel

	* [r7740] added my testset

2011-01-06 16:57  mvgompel

	* [r7739] important fixes in paragraph/sentence handling

2011-01-06 15:57  mvgompel

	* [r7738] some fixes in sentence/paragraph detection (not enough
	  yet)

2011-01-06 15:31  sloot

	* [r7737] now we can debug init() too

2011-01-06 15:28  sloot

	* [r7736] fixing

2011-01-06 15:04  sloot

	* [r7735] refuse to run when inputfile is bad

2011-01-06 14:58  mvgompel

	* [r7734] consistency updates

2011-01-06 14:51  mvgompel

	* [r7733] set docid to string, assuming utf-8

2011-01-06 14:46  mvgompel

	* [r7732] paragraph fix in XML output

2011-01-06 14:29  mvgompel

	* [r7731] fixed beginofsentence bug

2011-01-06 13:55  mvgompel

	* [r7730] merge fix

2011-01-06 13:50  mvgompel

	* [r7729] Added preliminary FoLiA XML output support in ucto

2011-01-06 13:50  mvgompel

	* [r7728] fix

2011-01-06 13:12  sloot

	* [r7727] added a simple 'man' page

2011-01-06 12:53  sloot

	* [r7726] terrible and destructive bug!

2011-01-06 12:52  sloot

	* [r7725] to do

2011-01-06 11:42  sloot

	* [r7724] added al -p switch to disable paragraph detection
	  how do I test that?

2011-01-06 11:14  sloot

	* [r7723] -d option now works

2011-01-05 15:40  mvgompel

	* [r7716] version update

2011-01-05 15:30  mvgompel

	* [r7714] Updated version

2011-01-05 15:07  pberck

	* [r7713] Small fix.

2011-01-05 15:05  mvgompel

	* [r7712] Big API overhaul

2011-01-05 13:43  sloot

	* [r7707] fix missing ChangeLog message while bootstrapping

2011-01-05 13:36  pberck

	* [r7706] Renamed Swedish tokconfig to tokconfig-sv according to
	  ISO-639-1.

2011-01-05 13:31  pberck

	* [r7705] First attempt at Swedish tokconfig added.

2010-12-27 12:22  mvgompel

	* [r7615] typo

2010-12-27 12:22  mvgompel

	* [r7614] updated

2010-12-27 12:22  mvgompel

	* [r7613] upaded README

2010-12-25 13:05  joostvb

	* [r7605] use sysconfdir variable, not hardcoded etc/

2010-12-25 12:59  joostvb

	* [r7604] do not install changelog, packagers take care of that

2010-12-25 12:23  joostvb

	* [r7600] release 0.0.1 before releasing 1.0

2010-12-25 12:18  joostvb

	* [r7599] ChangeLog is autogenerated, see bootstrap

2010-12-16 10:23  sloot

	* [r7443] '==" is illegal, but works on GNU system

2010-12-16 09:18  sloot

	* [r7440] adapting pberck's MacOSX adaptations

2010-12-16 09:18  sloot

	* [r7439] hmm

2010-12-16 06:50  pberck

	* [r7435] Fix for compiling on OS X.

2010-12-15 21:19  antalb

	* [r7434] changed Tokenizer to ucto

2010-12-15 17:10  sloot

	* [r7433] now the EOSMARKERS are realy fixed, i hope.

2010-12-15 16:00  sloot

	* [r7430] hmm, less const, otherwise FrogNG chokes

2010-12-15 15:54  sloot

	* [r7429] oesp

2010-12-15 15:53  sloot

	* [r7428] used a more clean UCI function to extract unicode numbers
	  to real characters

2010-12-15 15:49  sloot

	* [r7427] fixed other languages too
	  al lot of duplication, can't we do better?

2010-12-15 15:44  sloot

	* [r7425] fixed dutch too

2010-12-15 15:40  sloot

	* [r7422] two characters were wrong
	  some unicode entries were non-standard

2010-12-15 14:58  mvgompel

	* [r7418] renamed Tokenizer --> ucto

2010-12-15 14:19  sloot

	* [r7413] more const clean

2010-12-15 14:02  sloot

	* [r7412] more C++ like

2010-12-15 13:49  sloot

	* [r7411] some reshuffling. make local things more local

2010-12-15 11:57  sloot

	* [r7408] hmpfff, wrong messages

2010-12-15 11:42  sloot

	* [r7406] automagicly create an icu.pc file.
	  ICU doesn't provide one (it will do so starting with 4.6)

2010-12-14 16:59  sloot

	* [r7398] added more output fields

2010-12-14 13:58  sloot

	* [r7394] another attempt to fix it for MacOSX too

2010-12-13 21:28  mvgompel

	* [r7386] Added default configuration if no -c is specified, also
	  added -L shortcut

2010-12-13 21:00  mvgompel

	* [r7385] Added features + fixes

2010-12-13 18:20  mvgompel

	* [r7380] preparing for extra functions

2010-12-13 17:14  sloot

	* [r7379] reverted change. Mac only supports archaic ICU versions

2010-12-13 17:00  sloot

	* [r7378] attempt. but does it compile on a mac?

2010-12-13 16:49  sloot

	* [r7377] unused fuction removed

2010-12-13 16:40  sloot

	* [r7376] more standard ICU solution

2010-12-13 15:53  sloot

	* [r7367] merged with some new stuff from FrogNG

2010-12-13 15:07  sloot

	* [r7364] properties set

2010-12-13 14:59  sloot

	* [r7363] build a pkgconfig file

2010-12-13 13:20  sloot

	* [r7351] added

2010-12-13 13:16  sloot

	* [r7350] build an use libucto.so

2010-12-13 12:02  sloot

	* [r7346] did some cherry picking in unicode_utils.* and included
	  them in tokenize.*

2010-12-09 16:45  sloot

	* [r7260] oeps i did it again

2010-12-09 16:44  sloot

	* [r7258] oeps³

2010-12-09 16:42  sloot

	* [r7257] oeps²

2010-12-09 16:40  sloot

	* [r7256] oesp

2010-12-09 16:37  sloot

	* [r7255] let's be verbose

2010-12-09 16:09  sloot

	* [r7254] needed for build system

2010-12-09 16:07  sloot

	* [r7253] it works!
	  inclusive a small demo

2010-12-09 14:21  sloot

	* [r7250] more essential files

2010-12-09 14:16  sloot

	* [r7249] add some files

2010-12-09 14:14  sloot

	* [r7248] created basic dir structure

2010-12-09 11:59  sloot

	* [r7242] new dir

