diff options
author | Denys Vlasenko | 2010-02-01 22:35:30 +0100 |
---|---|---|
committer | Denys Vlasenko | 2010-02-01 22:35:30 +0100 |
commit | 4875e7148b0512ee3c255526a484503da984935a (patch) | |
tree | bc931a92ab19716b86809d1c06a38f1c7c88ec7c /docs/unicode.txt | |
parent | 698dca5805117f470ef19488428c8a5f795b9e0c (diff) | |
download | busybox-4875e7148b0512ee3c255526a484503da984935a.zip busybox-4875e7148b0512ee3c255526a484503da984935a.tar.gz |
docs/unicode.txt: added more TODOs
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
Diffstat (limited to 'docs/unicode.txt')
-rw-r--r-- | docs/unicode.txt | 31 |
1 files changed, 23 insertions, 8 deletions
diff --git a/docs/unicode.txt b/docs/unicode.txt index 019d12f..32df24d 100644 --- a/docs/unicode.txt +++ b/docs/unicode.txt @@ -26,30 +26,45 @@ But we also need to handle the following problematic moments: * Bidirectional handling. If user wants to echo a phrase in Hebrew, he types: echo "srettel werbeH" - Editors + Editors (vi, ed) This case is a bit similar to "shell input", but unlike shell, editors may encounder many more unexpected unicode sequences -(try to load a random binry file...), and they need to preserve +(try to load a random binary file...), and they need to preserve them, unlike shell which can afford to drop bogus input. - more, less -. +Need to correctly display any input file. Ideally, with +ASCII/unicode/filtered_unicode option or keyboard switch. +Note: need to handle tabs and backspaces specially +(bksp is for manpage compat). + + cut, fold, watch + +May need ability to cut unicode string to specified number of wchars +and/or to specified screen width. Need to handle tabs specially. + + sed, awk, grep + +Handle unicode-aware regexp match ls (multi-column display) -. +ls will fail to line up columnar output if it will not account +for character widths (and maybe filter out some of them, see +above). OTOH, non-columnar views (ls -1, ls -l, ls | car) +should NOT filter out bad unicode (but need to filter out +control chars (coreutils does that). Note that unlike more/less, +tabs and backspaces need not special handling. top, ps -. +Need to perform filtering similar to ls. Filename display (in error messages and elsewhere) -. - +Need to perform filtering similar to ls. TODO: write an email to Asmus Freytag (asmus@unicode.org), |