I actually tried it and it was reproducible on my system using sssd 1.11.6 ( ad and ldap config ).
[root@dirac linux]# python initgroups.py cmp12 119549 95 24 200 wrongs number of secondary groups in process 4363 : 5 instead of 95 (sleep 78ms) wrongs number of secondary groups in process 4366 : 5 instead of 95 (sleep 95ms) wrongs number of secondary groups in process 4353 : 5 instead of 95 (sleep 90ms) wrongs number of secondary groups in process 4362 : 5 instead of 95 (sleep 108ms) wrongs number of secondary groups in process 4358 : 5 instead of 95 (sleep 110ms) wrongs number of secondary groups in process 4371 : 5 instead of 95 (sleep 121ms)
I’ve been following the thread because i see this same behavior on our linux cluster which uses sssd for authentication.
When a lot of jobs hit the cluster, sometimes we’ll get failures because of authentication: "failed assumedly before job:can't get password entry for user "wfb6". Either the user does not exist or NIS error!"
Presumably the authentication mechanism could not keep up with the count of requests ( or large number of groups per user in the domain ).
-Chris
On Apr 21, 2015, at 5:37 PM, Jean-Baptiste Denis jbdenis@pasteur.fr wrote:
I think I've got a test case without involving slurm. It is quite reproductible on my machine. Since it looks like a race, you may need to tweak the parameter of the python script.
Hi,
does anyone from the sssd team (or not ;)) had a chance to reproduce the bug with the attached script in the previous message ?
Jean-Baptiste
sssd-users mailing list sssd-users@lists.fedorahosted.org https://lists.fedorahosted.org/mailman/listinfo/sssd-users