Changeset 28a5ebd in mainline for uspace/app/bdsh/tok.c


Ignore:
Timestamp:
2020-06-18T15:39:50Z (4 years ago)
Author:
Martin Decky <martin@…>
Branches:
lfn, master, serial, ticket/834-toolchain-update, topic/msim-upgrade, topic/simplify-dev-export
Children:
ce52c333
Parents:
4f663f3e
Message:

Use char32_t instead of wchat_t to represent UTF-32 strings

The intention of the native HelenOS string API has been always to
support Unicode in the UTF-8 and UTF-32 encodings as the sole character
representations and ignore the obsolete mess of older single-byte and
multibyte character encodings. Before C11, the wchar_t type has been
slightly misused for the purpose of the UTF-32 strings. The newer
char32_t type is obviously a much more suitable option. The standard
defines char32_t as uint_least32_t, thus we can take the liberty to fix
it to uint32_t.

To maintain compatilibity with the C Standard, the putwchar(wchar_t)
functions has been replaced by our custom putuchar(char32_t) functions
where appropriate.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • uspace/app/bdsh/tok.c

    r4f663f3e r28a5ebd  
    3636
    3737/* Forward declarations of static functions */
    38 static wchar_t tok_get_char(tokenizer_t *);
    39 static wchar_t tok_look_char(tokenizer_t *);
    40 static errno_t tok_push_char(tokenizer_t *, wchar_t);
     38static char32_t tok_get_char(tokenizer_t *);
     39static char32_t tok_look_char(tokenizer_t *);
     40static errno_t tok_push_char(tokenizer_t *, char32_t);
    4141static errno_t tok_push_token(tokenizer_t *);
    4242static bool tok_pending_chars(tokenizer_t *);
     
    9292{
    9393        errno_t rc;
    94         wchar_t next_char;
     94        char32_t next_char;
    9595
    9696        /* Read the input line char by char and append tokens */
     
    182182{
    183183        errno_t rc;
    184         wchar_t next_char;
     184        char32_t next_char;
    185185
    186186        while ((next_char = tok_look_char(tok)) != 0) {
     
    214214
    215215/** Get a char from input, advancing the input position */
    216 wchar_t tok_get_char(tokenizer_t *tok)
     216char32_t tok_get_char(tokenizer_t *tok)
    217217{
    218218        tok->in_char_offset++;
     
    221221
    222222/** Get a char from input, while staying on the same input position */
    223 wchar_t tok_look_char(tokenizer_t *tok)
     223char32_t tok_look_char(tokenizer_t *tok)
    224224{
    225225        size_t old_offset = tok->in_offset;
    226226        size_t old_char_offset = tok->in_char_offset;
    227         wchar_t ret = tok_get_char(tok);
     227        char32_t ret = tok_get_char(tok);
    228228        tok->in_offset = old_offset;
    229229        tok->in_char_offset = old_char_offset;
     
    232232
    233233/** Append a char to the end of the current token */
    234 errno_t tok_push_char(tokenizer_t *tok, wchar_t ch)
     234errno_t tok_push_char(tokenizer_t *tok, char32_t ch)
    235235{
    236236        return chr_encode(ch, tok->outbuf, &tok->outbuf_offset, tok->outbuf_size);
Note: See TracChangeset for help on using the changeset viewer.