Generic i18n library managing the fullstack interaction in a CI/CD pace. The dictionaries are stored in a DB edited by the translators through a(/the same) web application - managing translation errors, missing keys, ...
- 1.1.x - beta
The main documentation is in the repository
After re-implementing again and again internationalization as I was happy with none, I built up the minimum requirements I'd ask an 18n library to do:
- Synchronous translation
- Having a json-file (or alike) per language is perfect when the website/web-app is done. Nowadays, applications are never done, there is always a bug to fix, an error to add, a new functionality to implement, ... Most go through a shared document with "new keys to translate" or, worst, use the english sentence as a key and, after some typos, the sentence is translated in some language only after a user has reported it. A complete i18n library ought to offer a centralized way for coders to manage keys that allow translators to know what has to be translated, what has been modified (imagine the fuss when a dev changes a bit the meaning of an entry)
- Even if interpolation is based on a regular language and not a context-free one, it should allow wide customization, even JS(/TS) processors to be implemented by the developer. Also, some interpolation should be done once for all (like durations, ...) and use as much as possible the native Intl capacities
- Changing the displayed language should just retrieve the smallest needed dictionary, not the whole page, and re-translate the page dynamically - especially with modern frameworks like svelte, vue, ... (but not only)
Zones
(key-spaces?) is something I never found in any i18n library: allowing the translator to translate all the texts (emails sent by the server, messages, admin pages, ...) a centralized way, while the programer specifies which text has to be downloaded (or not) by whom (minimum needed dictionary transferred)
The library is composed of a server part and a client part.
The server:
- takes an object containing a
list
function that will query the DB - exposes a
condense
function that retrieve a condensed (processed) version of the dictionary for a locale (completely json-able).
The client part is a I18nClient
that will remember a locale and manage the queries to a server and language changes.
This client will produce Translator
s who make some wizardry like a proxy on a function that can be a string to describe text retrieval.
ℹ️ For some obscure reasons, the library is completely working on static applications
import { I18nServer, I18nClient } from 'omni18n'
const server = new I18nServer(myDBinterface)
const client = new I18nClient(['en-US'], server.condense)
// Here is where the actual DB-query occurs
const T = await client.enter()
// Will all display the entry `msg.hello` for the `en-US` (or `en`) locale
console.log(`${T.msg.hello}, ...`)
console.log(T.msg['hello'] + ', ...')
console.log(T('msg.hello') + ', ...')
The full-stack case will insert the http protocol between client
and server
. The condense
function takes few json-able arguments and return a (promise of) json-able object so can go through an http request.
The "Omni" part is that it can be integrated for various asynchronous scenarios and in many frameworks.
I18nServer
will never be instantiated in the browser, only a condense
function that will be linked to an HTTP request
import { I18nClient, type Condense } from 'omni18n'
const fetchCondensed: Condense = async (locales: Locale[], zones: string[])=> {
...
return result as CondensedDictionary[]
}
const client = new I18nClient(['en-US'], fetchCondensed)
// With many frameworks, dictionary data might be available on page load
client.usePartial(preloadedData)
// Here is where the actual download occurs if needed
const T = await client.enter()
The usePartial
usage is described here
In interactive mode (using InteractiveServer
), the DB interface contains modification functions and the server exposes modification function, that will modify the DB but also raise events. In this case, an InteractiveServer
instance has to be created for every client, with an interface toward the DB and a callback for event raising.
Two interfaces allow to implement an interface to any database: DB
(who basically just has a list
) and InteractiveDB
who has some modification access
Two are provided: a MemDB
who is basically an "in-memory database" and its descendant, a FileDB
who allows:
- reading from a file
- maintaining the files when changes are brought
The FileDB
uses a human-accessible js-like format for custom types) and based on \t
indentation file format only proper for this usage.
Having the translators managing translations in the UI while the devs have to access the file to add/remove keys, change their zone, ... and all this to go through git commits (so, to have local changes that will be integrated in the project after push/merge) can be done with FileDB
- for this, just interface a PUT
to a call on InteractiveServer::modify
(while that server has a FileDB
as a source) then the new file will be saved soon with the modified values.
Text keys are used as path, mostly beginning with a type (fld, msg, err, cmd, ...) and more sub-specification if needed.
ℹ️ I personally use such shortcuts, but feel free to use "field", "message", ...
- A key path can contain a translation AND sub-keys
- In such case, the most precise translation is used even if the sub-key does not exist
Example:
{
"fld.bday": "Birth date",
"fld.bday.short": "B-dt",
"fld.name": "Name"
}
In this case, both T.fld.name
and T.fld.name.short
will retrieve "Name"
so that, if the project uses shortened notations, it can display T.fld[field].short
without demanding all the fields to have a short
version in all languages
Rule of the thumb: No value should be given as root keys. Every meaningful text has a category and should therefore be a sub-key. Also, some helpers function detect if there is a dot to identify text-keys vs. other kind of designations.
If we take the examples of en-GB
and en-US
, four locales are going to be used: en-GB
and en-US
of course, en
who will take care of all the common english texts and ''
(the empty-named local) who contains technical things common to all languages.
So, downloading en-US
will download ''
overwritten with en
then overwritten with en-US
.
Common things are formats for example: format.price: '{number::$2|style: currency, currency: $1}'
for prices allowing T.format.price(currency, amount)
An application can therefore only define an
en
locale and useen-GB
anden-US
in order to have different number/date formatting.
I18nClient
is constructed with an array of locales. These are the locales "most preferred first". One can easily use the user's settings (often the interface propose "fallbacks") and add hard-coded the language(s) used by the developers.
Zones are "software zones". Each user don't need the whole dictionary. Some texts for example are only used in administration pages and should not be downloaded by everyone. A good way to divide zones for example is with a user's rights. Another way is even to have a zone per page/user-control. If zones are well entered, the whole needed dictionary will be loaded for the loaded page and complement added along browsing.
A special zone is server
who will contain texts never downloaded by the client, like registration emails and other texts used server-side only
Zones are in trees. admin.teams
will use the vocabulary of 3 zones: admin.teams
, admin
and the root zone ``.
Root zone that will contain all the common "Yes", "No", internals
, ....
In case of PoC, only the root zone can be used.
ℹ️ The library is optimized to download only the missing parts through a user's browsing experience.
⚠️ Zones are not different name spaces for text keys, each key is unique and has an associated zone.
Interpolation occurs to all texts (even if no arguments are given) and basically convert {...}
values using JS processors and/or the arguments.
It heavily relies on the hard-coded Intl mechanism, especially for date/number formats as well as plural and ordinal formations.
Examples:
Hello {$1|here}
There {plural::$1|is|are} {number::$1} {plural::$1|entry|entries}
{number:: $price | style: currency, currency: $currency}
Error reporting can be done eby overriding the report(key: string, error: string, spec?: object): void
function of I18nClient
- SvelteKit - Svelte4
- Nuxt - Vue3
- Editor - To edit
FileDB
dictionaries and give an idea of "translation page"
- tests:
- interpolation errors
- deserialization errors
- using client.onModification
- UMD E2E
The best way to report a bug is to PR a failing unit test.
Any DB or framework adaptation can be published separately, just report it and the reference will be added here.