package unidecode

  1. Overview
  2. Docs
Convert unicode strings into its ASCII representation

Install

Dune Dependency

Authors

Maintainers

Sources

v0.4.0.tar.gz
md5=88090f06c51464a0d829081640be3f3d
sha512=a411685ddb1d52585251eff3e2cd56fc9b4e91dcfceff424d7cfd5f721be73561d7a8e9223ecaeeda6178b02e5f6df74d50a49d4ce61d72a7d3fa1e8aefe5cf3

Description

Published: 30 Oct 2019

README

README.MD

# unidecode

Convert unicode strings into its ASCII representation.

The purpose of this library is the same as python's
[unidecode](https://pypi.org/project/Unidecode/) library (version
1.1.1).

Code of the initial release of this library has been extracted from
[GeneWeb](https://github.com/geneweb/geneweb) and adapted to be
released in an independent library.

## Installation

```bash
opam install unidecode
```

## License

Released under the terms of the [GNU GENERAL PUBLIC LICENSE](LICENSE).

## Limitations

- Only supports NFC normalization form.
- Transliteration targets french language
  (i.e. russian `у` gives `ou` while `u` could be expected).
  This will eventually be parameterizable.
- Transliteration might produce strange casing
  (e.g. `У` produce `OU` while `Ou` could be expected).
  Choosing between default (current) behavior, lower casing, upper
  casing, and capitalization will eventually be an option.

## Instructions for developpers

```bash
dune build            # build the library
dune install          # install the built library
dune clean            # clean compilation artifacts
dune runtest          # run unit tests
dune build @runbench  # compare with other libs
```

Dependencies (2)

  1. ocaml >= "4.05"
  2. dune >= "1.10"

Dev Dependencies

None

Used by

None

Conflicts

None