glvnd activity

a7g4 commented on issue #250 at glvnd / libglvnd

2024-07-11T04:06:56Z

Ahh - this was my mistake. Bazel by default was statically linking everything which was throwing away some of the __attribute__((constructor)) methods.

Thanks for pointing me in the right direction!

a7g4 closed issue #250: __glvndPthreadFuncs not getting populated in libegl at glvnd / libglvnd

2024-07-11T04:06:56Z

I am getting a segfault using libEGL:

Program received signal SIGSEGV, Segmentation fault.
0x0000000000000000 in ?? ()
(gdb) bt
#0  0x0000000000000000 in ?? ()
#1  0x00000000004dae93 in LockDispatch () at external/_main~_repo_rules~libglvnd/src/GLdispatch/GLdispatch.c:149
#2  0x00000000004daf01 in __glDispatchInit () at external/_main~_repo_rules~libglvnd/src/GLdispatch/GLdispatch.c:189
#3  0x00000000004d3f78 in __eglInit () at external/_main~_repo_rules~libglvnd/src/EGL/libegl.c:1354
#4  0x00007ffff782a32e in __libc_start_main_impl () from /lib64/libc.so.6
#5  0x0000000000404e85 in _start () at ../sysdeps/x86_64/start.S:115
(gdb) print __glvndPthreadFuncs
$1 = {create = 0x0, join = 0x0, self = 0x0, equal = 0x0, mutex_init = 0x0, mutex_destroy = 0x0, mutex_lock = 0x0, mutex_trylock = 0x0, mutex_unlock = 0x0, mutexattr_init = 0x0, mutexattr_destroy = 0x0, mutexattr_settype = 0x0, 
  rwlock_init = 0x0, rwlock_destroy = 0x0, rwlock_rdlock = 0x0, rwlock_wrlock = 0x0, rwlock_tryrdlock = 0x0, rwlock_trywrlock = 0x0, rwlock_unlock = 0x0, once = 0x0, key_create = 0x0, key_delete = 0x0, setspecific = 0x0, 
  getspecific = 0x0, is_singlethreaded = 0}

Looking at https://gitlab.freedesktop.org/glvnd/libglvnd/-/blob/master/src/EGL/libegl.c#L1355 I see that __glvndPthreadFuncs gets populated in glvndSetupPthreads() which is called after __glDispatchInit() where the segfault happens. Is there something before that which is supposed to set the __glvndPthreadFuncs.mutex_lock functioin pointer?

In case this is relevant I'm trying to build libglvnd in Bazel (by translating the meson build files) so I might have messed up some some of the preprocessor directives.

Kyle Brenneman commented on issue #250 at glvnd / libglvnd

2024-07-10T13:05:59Z

Hm. glvndSetupPthreads should be getting called from __glDispatchOnLoadInit, which is a ((constructor)) function for libGLdispatch.so. And, since libEGL.so depends on libGLdispatch.so, that means that __glDispatchOnLoadInit should run before __eglInit.

This crash would suggest that either __eglInit is running first, or that __glDispatchOnLoadInit isn't running at all. Either one would be a problem.

That behavior might suggest an error in your build script, such as an incorrect library list when you're linking libEGL.so. But, I'd expect that to fail to link, not to fail at runtime.

@a7g4 - What do you get when you run readelf -d libEGL.so.1 with the libEGL.so.1 from your Bazel build? Also, can you try running your program with the environment variable LD_DEBUG=all set, and attach the result?

readelf should show a NEEDED entry for libGLdispatch.so.0.

a7g4 opened issue #250: __glvndPthreadFuncs not getting populated in libegl at glvnd / libglvnd

2024-07-10T00:06:12Z

I am getting a segfault using libEGL:

Program received signal SIGSEGV, Segmentation fault.
0x0000000000000000 in ?? ()
(gdb) bt
#0  0x0000000000000000 in ?? ()
#1  0x00000000004dae93 in LockDispatch () at external/_main~_repo_rules~libglvnd/src/GLdispatch/GLdispatch.c:149
#2  0x00000000004daf01 in __glDispatchInit () at external/_main~_repo_rules~libglvnd/src/GLdispatch/GLdispatch.c:189
#3  0x00000000004d3f78 in __eglInit () at external/_main~_repo_rules~libglvnd/src/EGL/libegl.c:1354
#4  0x00007ffff782a32e in __libc_start_main_impl () from /lib64/libc.so.6
#5  0x0000000000404e85 in _start () at ../sysdeps/x86_64/start.S:115
(gdb) print __glvndPthreadFuncs
$1 = {create = 0x0, join = 0x0, self = 0x0, equal = 0x0, mutex_init = 0x0, mutex_destroy = 0x0, mutex_lock = 0x0, mutex_trylock = 0x0, mutex_unlock = 0x0, mutexattr_init = 0x0, mutexattr_destroy = 0x0, mutexattr_settype = 0x0, 
  rwlock_init = 0x0, rwlock_destroy = 0x0, rwlock_rdlock = 0x0, rwlock_wrlock = 0x0, rwlock_tryrdlock = 0x0, rwlock_trywrlock = 0x0, rwlock_unlock = 0x0, once = 0x0, key_create = 0x0, key_delete = 0x0, setspecific = 0x0, 
  getspecific = 0x0, is_singlethreaded = 0}

In case this is relevant I'm trying to build libglvnd in Bazel (by translating the meson build files) so I might have messed up some some of the preprocessor directives.

abushwang opened merge request !295: fix possible SEGV (buffer overflow) in __glXGetDrawableScreen() at glvnd / libglvnd

2024-06-21T03:46:30Z

fix issue #242

ss li closed issue #249: Is this libglx.c CommonMakeCurrent deadlock? at glvnd / libglvnd

2024-06-11T07:34:13Z

The information in the figure is the stack information obtained by gdb, when the graphical interface of the system is stuck and unresponsive

ss li commented on issue #249 at glvnd / libglvnd

2024-06-11T07:34:10Z

Ok, thank you very much for answering my doubts, because this is a difficult problem to reproduce. If this problem occurs again, I will share more detailed log information. I think the current issues can be closed, thank you!

Kyle Brenneman commented on issue #249 at glvnd / libglvnd

2024-06-08T11:23:29Z

A deadlock requires two threads, so without seeing the stack trace for both of them, there's no way to diagnose it. Also, you don't have the debug symbols, and without those, I won't be able to tell where it actually is.

Also, it's generally better to copy/paste from your terminal, rather than posting a photo of the screen.

ss li opened issue #249: Is this libglx.c CommonMakeCurrent deadlock? at glvnd / libglvnd

2024-06-08T06:55:31Z

The information in the figure is the stack information obtained by gdb, when the graphical interface of the system is stuck and unresponsive

Kyle Brenneman commented on merge request !228 at glvnd / libglvnd

2024-05-28T23:23:13Z

I haven't tested varlink's performance yet, no.

I figured that if dealing with container boundaries is one of our concerns, then we'd probably want to use DBus, because (I think?) that already has provisions for dealing with different container runtimes. And as long as we clearly define the interface, swapping out for a different implementation shouldn't be any different than it would be with varlink.

I'm definitely a fan of varlink's simplicity, though.

I did do some performance tests for local reading and parsing, but it was a few years ago, so I don't remember the precise results. I do remember that even on a fairly weak system (IIRC, I used either a Raspberry Pi 2 or 3 for the test), it was fast enough to not be concerned.

It would be a good thing to compare against, though.

Sebastian Wick commented on merge request !228 at glvnd / libglvnd

2024-05-27T08:36:17Z

Out of interest, did you also try to measure the latency of the current configuration implementation, and more importantly of a varlink + systemd socket activation based approach?

FWIW, I'm leaning more and more towards varlink because it's a socket at a well known path which means it's really easy to swap out for another implementation with mount namespaces and doesn't have the issues of a bus, and it's just a bit of json without any other dependencies.

Kyle Brenneman commented on merge request !228 at glvnd / libglvnd

2024-05-24T20:52:07Z

I threw together a quick test program to see what sort of latency we might be dealing with from DBus.

For this test, I used libdbus for the client, and GDBus for the server. The test consisted of sending a single method call, with an integer (for the PID) and an empty dictionary (for additional, possibly API-specific attributes) as parameters, and which returns a string (for driver name) and a byte array (for device UUID).

On the daemon side, I just had it return a hard-coded value, so there's zero overhead for device selection logic. A real daemon, obviously, would have additional time to examine parameters, read stuff from /proc, and sift through whatever configuration data it's got.

Using that test, from the client's perspective:

Connecting to the session bus takes about 1.5 ms
If the daemon is running and idle (i.e., ready to reply immediately), then sending a method and getting a reply takes upwards of 2 ms
If the daemon is not running, and DBus has to start the deamon from a .service file, then getting a reply takes 15-20 ms.
If the daemon is not running, and there isn't a .service file for one, then it takes about 1.5 ms to get an error back

More for curiosity than anything else, I also tried connecting directly to the daemon instead of going through the session bus. The method call overhead is about the same, but the initial connection overhead drops to something like 0.15 ms.

Anyway, if we assume that the daemon is either already running or doesn't exist at all, then going through DBus would add a minimum of 3-4 ms of startup time, plus whatever overhead the daemon has. Probably double that if it has to go through a proxy like xdg-desktop-portal.

In theory, that overhead could be offset if the daemon already has all of its configuration data loaded and ready to go.

If the daemon has to be started on demand, then add around 20 ms, plus the daemon's request overhead and startup time, which for a real daemon is likely to be more substantial.

Kyle Brenneman accepted merge request !293: Haiku: guard calls to fRenderer in HGL at glvnd / libglvnd

2024-05-17T14:24:18Z

Prevents dereferencing fRenderer when an EGL renderer is not found

Kyle Brenneman pushed to project branch master at glvnd / libglvnd

2024-05-17T14:24:18Z

Kyle Brenneman (606f6627) at 17 May 14:24

Merge branch 'haiku-bugfix' into 'master'

... and 1 more commit

Kyle Brenneman commented on merge request !293 at glvnd / libglvnd

2024-05-17T14:24:16Z

Ah, sorry, I somehow missed this when it came in.

Anyway, I don't really know my way around Haiku, but the change looks like it does what it says.

Kyle Brenneman closed merge request !294: egl: add EGL_EXT_config_select_group and EGL_MESA_x11_native_visual_id at glvnd / libglvnd

2024-05-09T14:57:52Z

Needed for alpha-blended config/visuals under X11.

For more details, see mesa/mesa!9989

Signed-off-by: David Heidelberg david.heidelberg@collabora.com

Kyle Brenneman commented on merge request !294 at glvnd / libglvnd

2024-05-09T14:57:52Z

I filed a pull request here to fix the extension spec: https://github.com/KhronosGroup/EGL-Registry/pull/199

Kyle Brenneman commented on merge request !294 at glvnd / libglvnd

2024-05-09T14:46:19Z

Correct, GL-CTS should check for EGL_EXT_config_select_group as a display extension, rather than a client extension. We should also correct the extension spec.

Something like CTS is a good demonstration of why it's especially important that this is a display extension. On a system with multiple drivers where one driver implements EGL_EXT_config_select_group and the other doesn't, then an EGLDisplay created on each driver would have a different order of EGLConfigs.

A client extension, by definition, applies to EGL as a whole rather than to any single EGLDisplay. So, if EGL_EXT_config_select_group were a client extension, then there's no way for CTS to know what order it's supposed to expect.

David Heidelberg commented on merge request !294 at glvnd / libglvnd

2024-05-09T00:07:54Z

What do you propose, the MR is still not merged yet, and we can fix the GL-CTS condition checking for EGL_EXT_config_select_group as a clientExtension.

Kyle Brenneman commented on merge request !294 at glvnd / libglvnd

2024-05-09T00:00:21Z

Neither of those are client extensions, and they should not be reported as such.

The spec for EGL_EXT_config_select_group does admittedly say it's a client extension, but that's an error in the spec.

Both extensions only affect the behavior of eglChooseConfig, which takes (and is dispatched based on) an EGLDisplay handle, and are therefore display extensions.