In 2007, I was spending a my norther-hemisphere summer experimenting with UI automation. Born of those efforts, xdotool came into being when I separated it from another project. The goal was modest - write some scripts that execute common keyboard, mouse, and window management tasks.
The first commit had only a few basic commands - basic mouse and keyboard actions, plus a few window management actions like movement, focus, and searching. Xdotool sprouted new features as time rolled on. Today, the project is 18 years old, and still going!
I definitely feel this frustration. Unfortunately, I don’t see a good way to bypass this period of fragmentation while protocols are developed. There are a lot of protocols that were a higher priority than these, but they will still receive support. I do think maybe the remote desktop protocol could’ve been a more generic desktop control protocol, which would help with the weird error messages.
It feels like a lot of this is framed as unnecessary security theater, which I wholeheartedly disagree with. Apps should absolutely not by default have the ability to control desktop input. The current lack of support is annoying, but that doesn’t mean it should be implemented insecurely, and a secure implementation takes time. X11 is still supported by Plasma, so if you need these features, that is still an option until they’re fully implemented by your preferred compositor.
Honestly, the fragmentation of the input automation doesn’t look that bad to me. It seems as if the big compositors support the remote desktop protocol, even if the popups are a little unintuitive and gnome doesn’t support saving the permission.
I definitely feel this frustration. Unfortunately, I don’t see a good way to bypass this period of fragmentation while protocols are developed. There are a lot of protocols that were a higher priority than these, but they will still receive support. I do think maybe the remote desktop protocol could’ve been a more generic desktop control protocol, which would help with the weird error messages.
It feels like a lot of this is framed as unnecessary security theater, which I wholeheartedly disagree with. Apps should absolutely not by default have the ability to control desktop input. The current lack of support is annoying, but that doesn’t mean it should be implemented insecurely, and a secure implementation takes time. X11 is still supported by Plasma, so if you need these features, that is still an option until they’re fully implemented by your preferred compositor.
Honestly, the fragmentation of the input automation doesn’t look that bad to me. It seems as if the big compositors support the remote desktop protocol, even if the popups are a little unintuitive and gnome doesn’t support saving the permission.