Feature reporting revamp

~~@gnachman Could you please copy over your comment from [the old location]? Then I'll delete that gist.~~ It's done now.

mentioned in issue #7

This is a fairly large and complex question. I'll just make a couple of comments:

No matter what we choose to do, for it to be viable, it has to get buy-in from ssh clients, particularly OpenSSH. The simplest change for OpenSSH to make would be to simply copy over the existing terminfo database to the remote computer automatically when connecting, see https://sw.kovidgoyal.net/kitty/faq.html#i-get-errors-about-the-terminal-being-unknown-or-opening-the-terminal-failing-when-sshing-into-a-different-computer for an example of how to do that today with a one liner.

It is true that terminfo is a pretty badly designed database, but continuing to use it is the simplest way forward.

Unlike historically, now-a-days terminal emulator developers are mostly in good communication with each other, thanks to the internet. This means that it is viable to have central standards and simply have whatever database we use indicate whether a terminal supports the feature (as defined in the standard, or not). So all we need is a boolean.

Thank you for putting this document together. This feature is overdue, to say the least. I appreciate how much thought you've put into it. I have one concern, and that is getting ssh to work soon enough that we can gain widespread adoption.

OpenSSH is popular, but there are many other SSH clients and servers. There is also telnet and mosh. I am concerned that getting this file from one place to another is going to be too difficult because it requires so many pieces of software to be changed before it can be useful. At a minimum, you'd need to update your:

terminal emulator
ssh client
ssh server
applications you run on the server

Moving a file around automatically is a little scary from a security POV. It's a new attack surface. Users don't expect ssh to do this, so it violates the principle of least astonishment.

My preference is to put this in an environment variable in a very compact encoding. ssh already has a mechanism for moving environment variables around. Environment variables do not have the same problems that files do. Filesystems are terrible:

Network file systems can hang indefinitely on any action
File system permissions can be broken
File may be visible to other users
Files can possibly be modified by other users
Machines may have read-only file systems
The file system is global state. Global state is bad.
Files in /tmp can be deleted at any time
File systems can be full

Files have two benefits over environment variables:

They can be big.
They can change if you change terminal emulators (as could happen with tmux)

I would argue that being big is an anti-feature, since users may bear the cost of copying this thing around. For the most part we're communicating booleans (we support features x, y, and z). The number of features is on the order of hundreds. A base-64 encoded array of 100 bits takes ~20 bytes. This could be further reduced with a better encoding.

Using a file does not completely solve the problem of attaching to tmux with a different terminal emulator. You can still be attached to the same session with two different terminal emulators at the same time. Applications would need to be relaunched to pick up changes to the file. This is an advantage for a file over an environment variable, but I don't think it's a large one.

Unlike historically, now-a-days terminal emulator developers are mostly in good communication with each other, thanks to the internet. This means that it is viable to have central standards and simply have whatever database we use indicate whether a terminal supports the feature (as defined in the standard, or not). So all we need is a boolean.

My problem with terminfo is that it's too slow to change. People will update their applications much more often than their terminfo databases. If I ship a new feature and neovim supports it in their next release, users should be able to use it immediately after upgrading the terminal and neovim.

You have a good point with various ssh implementation, I had to admit I assumed OpenSSH was the de facto standard used pretty much everywhere, and didn't take other implementations into account. In fact, it's the ssh protocol that needs to be extended first (input needed whether it's actually extendible the way I imagined).

I'm not worried about security implications and principle of least astonishment. If it becomes a widespread feature, it becomes a widespead knowledge that this happens. I believe the value of TERM is already forwarded even without mentioning it in Send/AcceptEnv. To be safe, there could be new options Send/AcceptTermdesc for this feature, to allow to enter a development (unstable) mode first where the feature isn't yet enabled by default, and then see if it becomes popular; or to gracefully deprecate it if our project fails. The server could throttle the frequency at which it accepts modifications to this data. Things like safely creating a file with a unique name are a piece of cake.

Anyway, if we aim to go for this approach, we should seek input from various ssh folks early on about feasibility and indeed possible security aspects.

Most of the issues you mention with files are in my opinion either addressable (e.g. being readable by others) or a non-issue (like NFS, read-only etc., each system should have a writable local place), or not our problem (e.g. if the file system is full, pretty much everything fails big time anyway).

As for being modifiable, which was my main design goal: One use case is tmux, if you change the underlying terminal emulator. It's in fact more complicated, since there can be 0 or more than one current emulators underneath. I think it would be up for tmux to decide on a case-by-case basis whether it proxies the features actually supported by its current underlying emulator (or the lowest common set in case of multiple emulators, and maybe the properties of the last one if currently unattached), or whether it always reports the most that tmux itself understand and then degrades the behavior towards the underlying emulator. If I were tmux's developer, I'd probably generally go for the latter.

Another use case I had in mind for modifiable data is primarly the current colors (to fix COLORFGBG), but also any other user modifiable variable. These might include the character encoding, might include data related to the width calculation of glyphs (see in gnome-terminal the user preference "Ambiguous-width chacacters: Narrow/Wide"), and probably a few others that don't occur currently to me. These are probably a relatively small subset of the entire set of features we'd report, the question is how important this set is.

For example, looking at the current problem set around width computation that triggered me writing down my thoughts (VTE 767529): Would the final design retain a user preference like "Ambiguous-width characters" or "Preferred Unicode version" or such, or would the final design be one that's fully escape sequence driven and has no user-toggleable feature? Because if keeping the graphical settings, there's no way we could synchronously convey that message inside an environment variable. Wouldn't dropping the modifiability of data undermine our attempts with fixing width calculation?

I don't expect this new feature to get adopted quickly. It'll take time. It's not urgent. I'd rather get it right (that is in my eyes: full featured) this time than having something that can be adopted more quickly, but still doesn't solve all our needs.

Indeed dropping the runtime modifiability of data would significantly simplify everything. If we take this path, are we anywhere elsewhere than with $TERMCAP? Wouldn't then the proper solution be just to resurrect this variable (with some new attributes added)?

screen/tmux with 2+ underlying emulators is a damn troublesome case. What if screen/tmux knows that the two underlying graphical emulators don't have the same colors, don't have the same font size etc., which one to report towards the application running inside? What if the two emulators' character width methods differ, e.g. tmux knows that one uses Unicode 8 and the other one uses Unicode 9, that is, outputting the high voltage sign will jump by 1 column in one and 2 column in the other? Which behavior should it report upwards and how could it make sure not to break the look in any of the underlying ones?

I guess it's fair to say that addressing these are non-goals for us :) (which might contradict my earlier words of trying to get things right this time).

continuing to use [terminfo] is the simplest way forward.

I'm not looking for the simplest way here. I'm looking for the best (simplicity is sure a factor in that).

it is viable to have central standards [and re: gnachman's response, too]

Central standards, like here on terminal-wg, and referring to them as booleans (e.g. maybe even using the feature's tracking number as identifier) – I guess it's okay.

Central repository describing what's supported by which terminal and what's not, like current terminfo (in case that's what you meant) – IMO absolutely no. There are many technical details to be addressed here, e.g. proper versioning, how to make sure that terminfo (or whichever central replacement copied locally) is updated on my computer, how to make sure it's updated on beta testers' computers, how to make sure it's updated in all the stable distros that'll ever be released so that it's at least consistent with the terminal emulators it ships... How to make sure it's updated on any server I'll ssh to...

There's also a human side to the story. I don't want to have to talk to anyone external in order to implement and promote a feature (it could boil down to plenty of human factors, e.g. me or anyone else feeling shy to repeatedly ask such a favor from someone else), I wish to do that solely inside the realm of the terminal emulator I'm developing. I don't want any single person or group of persons having the responsibility of keeping the descriptions fresh, nor handle in the future the case if that project gets abandoned, or is just short on resources and cannot handle the requests in a timely manner.

@gnachman The problem of slow terminfo updates is addressed by shipping terminfo with the emulator, and setting the TERMINFO var in all child processes. This is what kitty does. If ssh can be convonced to auto-copy terminfo, then it works everywhere, otherwise you need something like:

But I agree, in general, that the first step is to get feedback from ssh people on what they are willing to implement.

@egmontkob Have you reached out to any openSSH maintainers? If not, let's work up a proposal and send it to them.

mentioned in issue #9

mentioned in issue #11

I think we should not try to have runtime changing capability. The world is a lot larger than openssh and there are more channel that terminal command streams travel over. And it makes everything very much more complicated. For example "su" is just not able to do anything in many normal setups, because it simply does not run after arranging for the privileged process to be started (it execs it's command).

I think we should aim at something simple enough not to turn out to be a security nightmare. After all it will travel over a lot of security boundaries.

I think we should aim to make this very compact too. This would allow it to travel as environment variable, be pasted manually when some transport does not have support, etc.

I think we should mostly be ok with booleans or very small enums. A compact but flexible way could be to have sequential bits allocated by terminal-wg and then encoded as base64 and an additional section composed of namespaced key value pairs which for experimental features and features that have not yet been assigned one of the sequential bits.

So something as compact as TERMDESC="//////////8:vte.urls=1:iterm.sync=1" (variable length base64 plus : key = value) could represent 64 bits of centrally capabilities and 2 capabilities assigned by terminal implementers. Common capabilities supported by many terminals could then just migrate into the centrally assigned bits over time keeping the key value part manageable.

We need a good survey of what capabilities should be initially specified. Let's move thinking about what capabilities we need to #11

Here also i think we should specify a sequence based fallback for interactive applications and for bootstrapping in a shell if the information was not delivered by whatever forwarded a terminal connection. (Of course having it available synchronously is preferable)

We'll need a list of capabilities, or at the very least a rough guess at the number of capabilities we're interested in supporting. Has anyone got a link to LeoNerd's big spreadsheet of all known terminal capabilities?

I found a few ideas on coding sets of integers compactly I'd like not to lose. I think a RLE scheme would compress well if capability numbers are ordered from most commonly implemented to least-commonly implemented, and then with new ones tacked on at the end (which will of course be quite uncommon at first).

After a bit of research I think we should simply base85-encode the bitmap. I tried a bunch of fancy stuff and none of it helped much. Here's my code if you're interested:

bitmap_coding_comparison.py

Here are the results for various distributions of capabilities:

Length=250, t=1
Bits:      0011110111011001000010001110001010110100110110111001111011010100010101110011000000110100000010000101001011010110100001000110110111001111101101101100010010011011010011010101111011100001101111010011010101100000010000100100101101001100101001100010010111
Bitmap:    =^0G4kb$c&Vcx8^NZWkvh+5Kw3j;ce-ge7T#3t_m
Fibonacci: -Lr0H+cP%Jvu4KDZMMwY8!@{ZZL>DbnX_(e*|VFoX6$X7Y}+>2+S@gmyKTE|n;E+{X4$seyRox2>|g
Golomb 2X: 7!Y6p@L&+|VDJ#&5MThX;NS>g_=u?Z;NS>g@Ce{w0C4c|6ln1901%+?;P4P&kYE6y2mo+s;NSpY;P3=!@Cab=;NSpg01#ks=mcN@
Length=250, t=5
Bits:      1111111111111111001011011111001111111111110110111011111011110101111111111111000000110100010110000111001011010110100101000110110111001111101101101100010010011001000010010100101000000000001100000000000100100000000000000100000001001000001001000000000011
Bitmap:    zzGC^fB_%_02HJqNZWkvidxf#iv$1b-**4=?LYs(
Fibonacci: %{FZ8*lb%ixvtwb*|D=TX587cXE$ce+h*Ce&9iZvZrQ9mY;5h>ux55`nQ#
Golomb 2X: h=8DAfM8$%0AK(BpkM%iFaUrc5MU4xVDR8z0C4c|6ln190ALVc@Zj(e7_i^~U<d%F;P~X|@CIb?5MY1
Length=250, t=10
Bits:      1111111111111111111111111111111111111111111111111011111111111101111111111111100100111110010111000111101011010110110101000110110111001011101101101000010010000001000000000100100000000000001000000000000100000000000000000000000001000000000000000000000000
Bitmap:    0001h001BX00e*_NLzI6irUwXpC|wC|9}7g|NsBM
Fibonacci: #n~GM!?R_zvtw-8w`|>;H*K?K+cwzU+cTRnY#T{{
Golomb 2X: zyQEN0H6RMfB>LiK%f9%5TNkz=n!b|@Bm=o@bKX9;P?P&5ai%?`taZ
Length=250, t=20
Bits:      1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111110111001111111111110111110101111110110111001011100101100000000010000000000000000100000000000000001000000000000000000000000000000000000000000000000000000000000000
Bitmap:    0000000001004jhKpJ%I`uqRR|NsC0|NsC0|NsBM
Fibonacci: #CBbmWt(R0o3m?eor5=EEC
Golomb 2X: 00000003a%z(6n%Xb@=d^x)(MXzKU

@egmontkob Have you reached out to any openSSH maintainers? If not, let's work up a proposal and send it to them.

Nope I haven't; let's have some proposal first :)

The problem of slow terminfo updates is addressed by shipping terminfo with the emulator, and setting the TERMINFO var in all child processes. This is what kitty does. If ssh can be convonced to auto-copy terminfo, then it works everywhere, otherwise you need something like:

Did you mean TERM, or is TERMINFO also a standard variable that I'm unaware of?

Where would ssh place this file? ncurses (well, libtinfo probably) seems to look at ~/.terminfo first and then the global location, but can we count on all other curses implementations and all non-libtinfo-based apps (there really shouldn't be any of the latter kind, though, maybe slang) doing so? Is it okay for ssh just to place the terminfo file here?

What if it overrides a file the user wishes to maintain manually, can we just ignore this use case?

What if someone accesses the server from various different Kitty versions? Shall we encourage ssh to give these files a unique name (e.g. kitty-

RANDOM), but then who/when will clean them up, and won't it break apps that strcmp(

TERM, "some-fixed-value")? Or is it good enough just to place the file there using the actual value of TERM?

What about su, sudo; how will the descriptor be found and read across them?

(Kovid, your post ends prematurely, I'm not sure how you meant to finish it.)

I think we should not try to have runtime changing capability.

I guess I'm convinced here; it brings quite some complexity with marginal benefits (practically the color scheme only).

Feature reporting revamp

Feature Reporting in Terminal Emulators

The current methods

Asynchronous escape sequences

Environment variables

TERM

TERMCAP

Other environment variables

Draft recommendation

The termdesc file

Notification of change

ssh

screen, tmux

su, sudo

Asynchronous setting

File format

Privacy

Security

Miscellaneous

Knowledge split

Reporting of disabled attributes

Roadmap

Designs

Child items ...

Activity

Admin message

Admin message

Feature reporting revamp

Feature Reporting in Terminal Emulators

The current methods

Asynchronous escape sequences

Environment variables

TERM

TERMCAP

Other environment variables

Draft recommendation

The termdesc file

Notification of change

ssh

screen, tmux

su, sudo

Asynchronous setting

File format

Privacy

Security

Miscellaneous

Knowledge split

Reporting of disabled attributes

Roadmap

Activity