What steps will reproduce the problem? 1. SLES 11.3 with slightly patched 3.16 kernel Linux memcached9 3.16.3-4.1.100-default #1 SMP Thu Sep 18 06:32:16 UTC 2014 (d2bbe7f) x86_64 x86_64 x86_64 GNU/Linux
64 Gig total memory
memcached 1.4.24, built from source with CFLAGS="-g3 -O3" + --disable-coverage
memcached started as /usr/sbin/memcached -d -F -m 60000 -k -u nobody -p 11211 -c 65535 -t 12 -o lru_maintainer lru_crawler hash_algorithm=murmur3 slab_reassign
crashes with [1565425.309019] traps: memcached[17639] general protection ip:4123a7 sp:7feade1ece70 error:0 in memcached-1.4.24-33[400000+20000]
$ addr2line -e /usr/sbin/memcached 4123a7 memcached-1.4.24/items.c:1183
we use lru_crawler quite a bit and have custom scripts that use "slabs reassign" to move memory between slabs based on slabs stats (i.e. in house custom slab reassign).
As an anecdote, 1.4.21 occasionally falls into infinite loop under these conditions as well, and we were trying to fix that by upgrading.
Comment #1
Posted on Jul 5, 2015 by Helpful Birdcrap, I wish I saw this sooner.
What is your items.c:1183?
It looks like: if (it->prev) { it->prev->next = it; } ^ middle line.
I'd love to know how that's possible. any chance of getting a backtrace?
Comment #2
Posted on Jul 6, 2015 by Helpful WombatYeah, that code. But no backtrace, sorry. Can't reproduce in a lab, and scary to "test on production" :-|
Status: New
Labels:
Type-Defect
Priority-Medium