package uunf

  1. Overview
  2. Docs
Unicode text normalization for OCaml


Dune Dependency





v15.0.0 2022-09-15 Zagreb

  • Unicode 15.0.0 support.

v14.0.0 2021-09-17 Zagreb

  • Unicode 14.0.0 support.

v13.0.0 2020-03-11 La Forclaz (VS)

  • Unicode 13.0.0 support.

  • Require OCaml >= 4.03.0.

v12.0.0 2019-03-08 La Forclaz (VS)

  • Unicode 12.0.0 support.

v11.0.0 2018-06-06 Lausanne

  • Unicode 11.0.0 support.

  • Fix bug when normalizer flushes at the end of stream: a spurious Await was returned before the final End.

v10.0.0 2017-06-20 Cambridge (UK)

  • Unicode 10.0.0 support

  • Fix bug in canonical composition algorithm (used by NFC and NFKC forms). Thanks to Stephen Dolan for the report.

  • Fix regression of Uucp.ccc introduced by f4c0363 which went into v2.0.{0,1}.

v2.0.1 2016-03-07 La Forclaz (VS)

  • OCaml 4.05.0 compatibility (removal of Uchar.dump).

v2.0.0 2016-11-23 Zagreb

  • Support for Unicode 9.0.0.

  • OCaml standard library Uchar.t support.

    • Removes and substitutes type Uunf.uchar = int by the (abstract) Uchar.t type. Uchar.{of,to}_int allows to recover the previous representation.

    • Removes Uunf.is_scalar_value. Uchar.is_valid can be used instead.

  • Safe string support.

  • Build depend on topkg.

  • Relicense from BSD3 to ISC.

v1.0.0 2015-06-17 Cambridge (UK)

  • Updated for Unicode 8.0.0

  • topkg support

  • Uunf.add now eventually returns `End whenever the latter was encoded and the character stream was entirely output. In most existing programs this will simply entail to add `End to the existing `Await case in pattern matches on the result of Uunf.add.

  • Adds the Uunf_string library that allows to directly normalize UTF-X OCaml encoded strings. This library depends on Uutf.

  • Rewrote the utility unftrip to use Cmdliner which is now an optional dependency of the package. The cli interface is incompatible with previous versions. Support for random Unicode scalar value generation was removed, use utftrip from the Uutf package for that.

  • Rewrote the module's data generation to essentially match what is done in Uucp. Much less ugly, no source file sedding.

v0.9.3 2014-06-16 Cambridge (UK)

  • Updated for Unicode 7.0.0

v0.9.2 2013-10-01 Lausanne

  • Updated for Unicode 6.3.0

  • OPAM friendly workflow and drop OASIS support.

v0.9.1 2013-01-04 La Forclaz (VS)

  • Updated for Unicode 6.2.0.

  • Fix Uunf.is_scalar_value always returning false.

  • Make the module completely safe for the client.

  • Change command line help of unftrip.

v0.9.0 2012-09-07 Lausanne

First release.