Issue #1948: [abrt] sssd-1.9.5-1.fc18: sss_mmap_cache_gr_invalidate_gid: Process /usr/libexec/sssd/sssd_nss was killed by signal 11 (SIGSEGV) - sssd

SSSD / sssd

#1948 [abrt] sssd-1.9.5-1.fc18: sss_mmap_cache_gr_invalidate_gid: Process /usr/libexec/sssd/sssd_nss was killed by signal 11 (SIGSEGV)

Closed: Duplicate None Opened 10 years ago by jhrozek.

Ticket was cloned from Red Hat Bugzilla (product Fedora): Bug 967012

Description of problem:
reboot system

Version-Release number of selected component:
sssd-1.9.5-1.fc18

Additional info:
reporter:       libreport-2.1.4
backtrace_rating: 4
cmdline:        /usr/libexec/sssd/sssd_nss --debug-to-files
crash_function: sss_mmap_cache_gr_invalidate_gid
executable:     /usr/libexec/sssd/sssd_nss
kernel:         3.9.2-200.fc18.x86_64
runlevel:       N 3
uid:            0

Truncated backtrace:
Thread no. 1 (10 frames)
 #0 sss_mmap_cache_gr_invalidate_gid at
src/responder/nss/nsssrv_mmap_cache.c:735
 #1 nss_update_initgr_memcache at src/responder/nss/nsssrv_cmd.c:3498
 #2 nss_memcache_initgr_check at src/responder/nss/nsssrv.c:332
 #3 sbus_message_handler at src/sbus/sssd_dbus_connection.c:430
 #4 _dbus_object_tree_dispatch_and_unlock at dbus-object-tree.c:862
 #6 sbus_dispatch at src/sbus/sssd_dbus_connection.c:104
 #7 tevent_common_loop_timer_delay at ../tevent_timed.c:254
 #8 std_event_loop_once at ../tevent_standard.c:560
 #9 _tevent_loop_once at ../tevent.c:507
 #10 tevent_common_loop_wait at ../tevent.c:608

dpal commented 10 years ago

Fields changed

blockedby: =>
blocking: =>
coverity: =>
design: =>
design_review: => 0
feature_milestone: =>
fedora_test_page: =>
milestone: NEEDS_TRIAGE => SSSD 1.10.0
priority: major => critical
review: True => 0
selected: =>
testsupdated: => 0

dpal commented 10 years ago

Fields changed

rhbz: [https://bugzilla.redhat.com/show_bug.cgi?id=967012 967012] => [https://bugzilla.redhat.com/show_bug.cgi?id=967012 967012] todo

mzidek commented 10 years ago

Fields changed

changelog: =>
owner: somebody => mzidek
status: new => assigned

jhrozek commented 10 years ago

Not a regression, not blocking the 1.10.0 release.

milestone: SSSD 1.10.0 => SSSD 1.10.1

lslebodn commented 10 years ago

725         slot = mcc->hash_table[hash];
726         if (slot > MC_SIZE_TO_SLOTS(mcc->dt_size)) {
727             ret = ENOENT;
728             goto done;
729         }
730
731         while (slot != MC_INVALID_VAL) {
732             rec = MC_SLOT_TO_PTR(mcc->data_table, slot, struct sss_mc_rec);
733             data = (struct sss_mc_grp_data *)(&rec->data);
734
735             if (gid == data->gid) {

From backtrace:

sssd_nss crashed at line 735
rec is not NULL (line 732, rec <- 0x7f77d8f58e98)
rec != NULL therefore data could not be NULL (line 733, data <- 0x7f77d8f58eb8)

The main problem is that pointer "data" has very big value. Reason is count of slots (variable with name slot) is also too big [[BR]]
slot <- 100,689,267. I can not understand why is this value so big, because there is a check on line 726.

Value of some variables:

can be directly find in backtrace
another values was indirectly calculated from cpu registers

*some values can not be retrieved, because values was stored only in registers (optimized out) and later overridden with another values (ex. mcc->dt_size)

Possible explanation:

mcc->dt_size is also very big, therefore ENOENT was not returned
size of allocated data (mcc->data_table) is not the same as mcc->dt_size.
mcc must be corrupted

It is impossible to find real reason without core file or log files.

_comment0: {{{
725 slot = mcc->hash_table[hash];
726 if (slot > MC_SIZE_TO_SLOTS(mcc->dt_size)) {
727 ret = ENOENT;
728 goto done;
729 }
730
731 while (slot != MC_INVALID_VAL) {
732 rec = MC_SLOT_TO_PTR(mcc->data_table, slot, struct sss_mc_rec);
733 data = (struct sss_mc_grp_data *)(&rec->data);
734
735 if (gid == data->gid) {
}}}

From backtrace:
sssd_nss crashed at line 735
rec is not NULL (line 732, rec <- 0x7f77d8f58e98)
* rec != NULL therefore data could not be NULL (line 733, data <- 0x7f77d8f58eb8)

The main problem is that pointer "data" has very big value. Reason is count of slots (variable with name slot) is also too big [[BR]]
slot <- 10,689,267. I can not understand why is this value so big, because there is a check on line 726.

Value of some variables:
can be directly find in backtrace
another values was indirectly calculated from cpu registers
*some values can not be retrieved, because values was stored only in registers (optimized out) and later overridden with another values (ex. mcc->dt_size)

Possible explanation:
mcc->dt_size is also very big, therefore ENOENT was not returned
size of allocated data (mcc->data_table) is not the same as mcc->dt_size.
* mcc must be corrupted

It is impossible to find real reason without core file or log files. => 1375720857977492

lslebodn commented 10 years ago

Fields changed

cc: => lslebodn@redhat.com

jhrozek commented 10 years ago

Moving tickets that didn't make 1.10.1 to the 1.10.2 bucket.

jhrozek commented 10 years ago

Moving tickets that didn't make 1.10.1 to 1.10.2

milestone: SSSD 1.10.1 => SSSD 1.10.2

jhrozek commented 10 years ago

This is a duplicate of #2018 that has better information, so I'm going to fold them into one.

resolution: => duplicate
status: assigned => closed

jhrozek commented 10 years ago

Fields changed

rhbz: [https://bugzilla.redhat.com/show_bug.cgi?id=967012 967012] todo => [https://bugzilla.redhat.com/show_bug.cgi?id=967012 967012]

Metadata Update from @jhrozek:
- Issue assigned to mzidek
- Issue set to the milestone: SSSD 1.10.2

7 years ago

pbrezina commented 4 years ago

SSSD is moving from Pagure to Github. This means that new issues and pull requests
will be accepted only in SSSD's github repository.

This issue has been cloned to Github and is available here:
- https://github.com/SSSD/sssd/issues/2990

If you want to receive further updates on the issue, please navigate to the github issue
and click on subscribe button.

Thank you for understanding. We apologize for all inconvenience.

Metadata

Assignee

mzidek

Tags

None

Blocking

None

Depending on

None

Priority

critical

Milestone

SSSD 1.10.2

type

defect

component

SSSD

version

None

selected

None

testsupdated

patch

rhbz

https://bugzilla.redhat.com/show_bug.cgi?id=967012

design_review

review

changelog

None

keywords

None

coverity

None

mark

None

blocking

None

design

None

sensitive

None

lslebodn@redhat.com

blockedby

None

feature_milestone

None

SSSD / sssd

Source Code

Documentation

#1948 [abrt] sssd-1.9.5-1.fc18: sss_mmap_cache_gr_invalidate_gid: Process /usr/libexec/sssd/sssd_nss was killed by signal 11 (SIGSEGV) Closed: Duplicate None Opened 10 years ago by jhrozek.

Metadata

#1948 [abrt] sssd-1.9.5-1.fc18: sss_mmap_cache_gr_invalidate_gid: Process /usr/libexec/sssd/sssd_nss was killed by signal 11 (SIGSEGV)

Closed: Duplicate None Opened 10 years ago by jhrozek.