Skip to content
@unicode-org

The Unicode Consortium

The standards body for character encoding and internationalization.

Welcome to the Unicode Consortium on GitHub

The Unicode Consortium is the standards body for character encoding and internationalization of software and services. Read more about us. Repositories here are maintained by various technical committees, please see here for information about each commitee.

Copyright © 1991-2024 Unicode, Inc. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the United States and other countries. Unicode Github repositories and their contents are subject to the Unicode Terms of Use. A CLA is required to contribute to Unicode projects - please refer to the CONTRIBUTING.md file (or start a Pull Request) for more information.

Pinned Loading

  1. icu icu Public

    The home of the ICU project source code.

    C++ 3.3k 832

  2. cldr cldr Public

    The home of the Unicode Common Locale Data Repository

    Java 1k 409

  3. message-format-wg message-format-wg Public

    Developing a standard for localizable message strings

    JavaScript 280 35

  4. icu4x icu4x Public

    Solving i18n for client-side and resource-constrained environments.

    Rust 1.6k 230

Repositories

Showing 10 of 38 repositories
  • icu Public

    The home of the ICU project source code.

    unicode-org/icu’s past year of commit activity
    C++ 3,277 832 0 91 Updated Oct 27, 2025
  • icu4x Public

    Solving i18n for client-side and resource-constrained environments.

    unicode-org/icu4x’s past year of commit activity
    Rust 1,641 230 577 (128 issues need help) 38 Updated Oct 27, 2025
  • message-format-wg Public

    Developing a standard for localizable message strings

    unicode-org/message-format-wg’s past year of commit activity
    JavaScript 280 35 25 4 Updated Oct 27, 2025
  • cldr Public

    The home of the Unicode Common Locale Data Repository

    unicode-org/cldr’s past year of commit activity
    Java 1,030 409 0 201 Updated Oct 27, 2025
  • cldr-staging Public

    Proposed production data for CLDR data

    unicode-org/cldr-staging’s past year of commit activity
    HTML 29 14 0 4 Updated Oct 27, 2025
  • cldr-json Public

    JSON Data from the Unicode CLDR Project

    unicode-org/cldr-json’s past year of commit activity
    Shell 616 83 1 3 Updated Oct 27, 2025
  • unicodetools Public

    home of unicodetools and https://util.unicode.org JSPs

    unicode-org/unicodetools’s past year of commit activity
    HTML 67 58 165 59 Updated Oct 25, 2025
  • lstm_word_segmentation Public

    Python code for training an LSTM model for word segmentation in Thai, Burmese, and similar languages.

    unicode-org/lstm_word_segmentation’s past year of commit activity
    Python 25 14 7 4 Updated Oct 23, 2025
  • icu4x-docs Public

    ICU4X Docs

    unicode-org/icu4x-docs’s past year of commit activity
    HTML 4 7 1 2 Updated Oct 21, 2025
  • icu-perf Public

    ICU performance test results. Maintained by ICU-TC

    unicode-org/icu-perf’s past year of commit activity
    JavaScript 4 2 0 0 Updated Oct 16, 2025