[Users] Kernel 2.6.18-openvz-13-39.1d1-amd64 oops
E Frank Ball III
efball at efball.com
Wed Oct 10 16:04:20 EDT 2007
I'm had a similar crash using the latest 686 kernel from
download.openvz.org:
linux-image-2.6.18-openvz-13-39.1d2-686_028.39.1d2_i386.deb
vzctl --version
vzctl version 3.0.11
Two boxes are using it, one is fine, the other hung once.
All the logs had was this:
Oct 5 19:15:01 penguin /USR/SBIN/CRON[31518]: (root) CMD (/usr/share/vzctl/scripts/vpsnetclean)
Oct 8 10:34:35 penguin syslogd 1.4.1#18: restart.
There was some spew on the console, but I didn't know how to capture that.
E Frank Ball efball at efball.com
On Wed, Oct 10, 2007 at 07:54:11AM +0200, Martin Trtusek wrote:
> I installed kernel 2.6.18-openvz-13-39.1d1-amd64 from
> http://download.openvz.org/debian on Debian Etch one week ago and
> experienced kernel oops (complete freezing, off/on necessary) after 2-3
> days of running (3 times). Oops is always after cron.daily scripts (in
> my case 06:25) but not everyday. Yesterday I configured netconsole for
> capturing useful info, enclosed.
>
> Hardware was tested very strong on installation. With stock Debian
> kernel (initrd.img-2.6.18-5-amd64) server does not have any problem (3
> months of operation). There are 3 VPS running, without really using.
>
> Enclosed last entry in syslog (before crash). Looks like problem is
> invoking by /usr/share/vzctl/scripts/vpsnetclean
> or /usr/share/vzctl/scripts/vpsreboot. Booth scripts are from vzctl
> package, I installed it from http://debian.systs.org/
>
> # vzctl --version
> vzctl version 3.0.18-1dso1
>
> I am leaving office now, additional info (if necessary) I can send
> tomorrow.
>
> Martin Trtusek
> netconsole: network logging started
> Warning: /proc/ide/hd?/settings interface is obsolete, and will be removed soon!
> st: Version 20050830, fixed bufsize 32768, s/g segs 256
> sd 0:0:0:0: Attached scsi generic sg0 type 0
> sd 0:0:1:0: Attached scsi generic sg1 type 0
> sd 1:0:0:0: Attached scsi generic sg2 type 0
> sd 1:0:1:0: Attached scsi generic sg3 type 0
> BIOS EDD facility v0.16 2004-Jun-25, 4 devices found
> ----------- [cut here ] --------- [please bite here ] ---------
> Kernel BUG at kernel/sched.c:3798
> invalid opcode: 0000 [1] SMP
> CPU: 0
> Modules linked in: edd joydev sg st sr_mod netconsole vzethdev vznetdev simfs vzrst vzcpt vzdquota vzmon vzdev button ac battery ip6table_filter ip6_tables iptable_raw xt_policy xt_multiport ipt_ULOG ipt_TTL ipt_ttl ipt_TOS ipt_tos ipt_TCPMSS ipt_SAME ipt_REJECT ipt_REDIRECT ipt_recent ipt_owner ipt_NETMAP ipt_MASQUERADE ipt_LOG ipt_iprange ipt_hashlimit ipt_ECN ipt_ecn ipt_DSCP ipt_dscp ipt_CLUSTERIP ipt_ah ipt_addrtype ip_nat_tftp ip_nat_snmp_basic ip_nat_pptp ip_nat_irc ip_nat_ftp ip_nat_amanda ip_conntrack_tftp ip_conntrack_pptp ip_conntrack_netbios_ns ip_conntrack_irc ip_conntrack_ftp ts_kmp ip_conntrack_amanda xt_tcpmss xt_pkttype xt_physdev bridge xt_NFQUEUE xt_MARK xt_mark xt_mac xt_limit xt_length xt_helper xt_dccp xt_conntrack xt_CONNMARK xt_connmark xt_CLASSIFY xt_tcpudp xt_state iptable_nat ip_nat ip_conntrack iptable_mangle nfnetlink iptable_filter ip_tables x_tables ipv6 dummy aes_x86_64 sha512 sha256 loop evdev psmouse i2c_i801 shpchp pci_hotplug serio_raw i2c_core pcspkr floppy ext3 jbd mbcache raid10 ide_generic sd_mod ide_cd cdrom ata_piix libata generic piix ide_core ehci_hcd e1000 uhci_hcd thermal processor fan cciss scsi_mod dm_snapshot dm_mirror dm_crypt dm_mod raid456 xor raid1 md_mod
> Pid: 0, comm: swapper Not tainted 2.6.18-openvz-13-39.1d1-amd64 #1
> RIP: 0060:[<ffffffff8027fd35>] [<ffffffff8027fd35>] rebalance_tick+0x391/0x57a
> RSP: 0068:ffffffff804c4b18 EFLAGS: 00010046
> RAX: ffffffff804e1980 RBX: ffffffff804e1980 RCX: 0000000000000020
> RDX: 0000000000000020 RSI: ffffffff804e17d0 RDI: 0000000000000001
> RBP: ffffffff804c4bb8 R08: ffffffff804c4b68 R09: ffffffff804c4b68
> R10: 0000000000000000 R11: 0000000000000002 R12: ffff810001020340
> R13: ffff81011b0c2000 R14: ffff81011b0c2000 R15: ffffffff804e2c80
> FS: 0000000000000000(0000) GS:ffffffff80526000(0000) knlGS:0000000000000000
> CS: 0060 DS: 0068 ES: 0068 CR0: 000000008005003b
> CR2: 0000000000a10048 CR3: 000000010196d000 CR4: 00000000000006e0
> Process swapper (pid: 0, veid=0, threadinfo ffffffff80534000, task ffffffff80449be0)
> Stack: 0000000000000002 ffff810101161240 0000000000000000 ffffffff804e2c80
> 0000000202555ed8 ffff81011b0c2000 ffffffff8027b532 0000000000000001
> 0000000000000003 0000000000000082 00000000ffffffff 000047783a3fde8a
> Call Trace:
> <IRQ> [<ffffffff8027b532>] vcpu_attach+0x7e/0xc3
> [<ffffffff8028a68f>] update_process_times+0x5c/0x68
> [<ffffffff8026c8e1>] smp_local_timer_interrupt+0x23/0x47
> [<ffffffff8026ce7f>] smp_apic_timer_interrupt+0x99/0x9f
> [<ffffffff8025bdda>] apic_timer_interrupt+0x66/0x6c
> [<ffffffff8024e78d>] bio_fs_destructor+0x0/0xc
> [<ffffffff88124830>] :libata:ata_scsi_rw_xlat+0x0/0x37e
> [<ffffffff8022d780>] mempool_free+0x10/0x74
> [<ffffffff8023f729>] bio_free+0x33/0x43
> [<ffffffff8023f176>] end_bio_bh_io_sync+0x37/0x3b
> [<ffffffff88042255>] :dm_mod:dec_pending+0xab/0xce
> [<ffffffff88042392>] :dm_mod:clone_endio+0x7f/0x9b
> [<ffffffff88162aa6>] :raid10:raid_end_bio_io+0x2c/0x80
> [<ffffffff881645b2>] :raid10:raid10_end_read_request+0x66/0xe9
> [<ffffffff803010b1>] elv_next_request+0x141/0x151
> [<ffffffff8022b938>] __end_that_request_first+0x153/0x49e
> [<ffffffff803117ce>] swiotlb_unmap_sg+0x9c/0xed
> [<ffffffff8806b597>] :scsi_mod:scsi_delete_timer+0x12/0x59
> [<ffffffff8806cbf9>] :scsi_mod:scsi_end_request+0x27/0xcb
> [<ffffffff8806cdf3>] :scsi_mod:scsi_io_completion+0x156/0x334
> [<ffffffff881203fd>] :libata:ata_hsm_move+0x642/0x661
> [<ffffffff881584a3>] :sd_mod:sd_rw_intr+0x217/0x244
> [<ffffffff8806d091>] :scsi_mod:scsi_device_unbusy+0x67/0x81
> [<ffffffff80236488>] blk_done_softirq+0x5f/0x6d
> [<ffffffff8021030f>] __do_softirq+0x98/0x138
> [<ffffffff8025c43c>] call_softirq+0x1c/0x28
> [<ffffffff802661c3>] do_softirq+0x2c/0x7d
> [<ffffffff802662d6>] do_IRQ+0xc2/0xcb
> [<ffffffff80255128>] mwait_idle+0x0/0x4a
> [<ffffffff8025b761>] ret_from_intr+0x0/0xa
> <EOI> [<ffffffff8025515e>] mwait_idle+0x36/0x4a
> [<ffffffff8024703e>] cpu_idle+0x60/0x7f
> [<ffffffff8053e7be>] start_kernel+0x23b/0x240
> [<ffffffff8053e288>] _sinittext+0x288/0x28c
>
>
> Code: 0f 0b 68 df 6e 40 80 c2 d6 0e 4c 39 eb 48 89 5d 88 0f 84 57
> RIP [<ffffffff8027fd35>] rebalance_tick+0x391/0x57a
> RSP <ffffffff804c4b18>
> Kernel panic - not syncing: Aiee, killing interrupt handler!
> Oct 10 06:15:01 vochomurka /USR/SBIN/CRON[14743]: (root) CMD (if [ -x /etc/munin/plugins/apt_all ]; then /etc/munin/plugins/apt_all update 7200 12 >/dev/null; elif [ -x /etc/munin/plugins/apt ]; then /etc/munin/plugins/apt update 7200 12 >/dev/null;
> fi)
> Oct 10 06:15:01 vochomurka /USR/SBIN/CRON[14745]: (munin) CMD (if [ -x /usr/bin/munin-cron ]; then /usr/bin/munin-cron; chmod -R o+r /var/www/munin; fi)
> Oct 10 06:15:01 vochomurka /USR/SBIN/CRON[14747]: (root) CMD (/usr/share/vzctl/scripts/vpsreboot)
> Oct 10 06:15:01 vochomurka /USR/SBIN/CRON[14749]: (root) CMD (/usr/share/vzctl/scripts/vpsnetclean)
> Oct 10 06:17:01 vochomurka /USR/SBIN/CRON[16783]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
> Oct 10 06:17:10 vochomurka ntpdate[16786]: adjust time server 195.113.144.238 offset -0.007761 sec
> Oct 10 06:20:01 vochomurka /USR/SBIN/CRON[16788]: (root) CMD (if [ -x /etc/munin/plugins/apt_all ]; then /etc/munin/plugins/apt_all update 7200 12 >/dev/null; elif [ -x /etc/munin/plugins/apt ]; then /etc/munin/plugins/apt update 7200 12 >/dev/null;
> fi)
> Oct 10 06:20:01 vochomurka /USR/SBIN/CRON[16790]: (munin) CMD (if [ -x /usr/bin/munin-cron ]; then /usr/bin/munin-cron; chmod -R o+r /var/www/munin; fi)
> Oct 10 06:20:01 vochomurka /USR/SBIN/CRON[16792]: (root) CMD (/usr/share/vzctl/scripts/vpsreboot)
> Oct 10 06:20:01 vochomurka /USR/SBIN/CRON[16794]: (root) CMD (/usr/share/vzctl/scripts/vpsnetclean)
> Oct 10 06:25:01 vochomurka /USR/SBIN/CRON[18831]: (root) CMD (test -x /usr/sbin/anacron || ( cd / && run-parts --report /etc/cron.daily ))
> Oct 10 06:25:01 vochomurka /USR/SBIN/CRON[18833]: (root) CMD (if [ -x /etc/munin/plugins/apt_all ]; then /etc/munin/plugins/apt_all update 7200 12 >/dev/null; elif [ -x /etc/munin/plugins/apt ]; then /etc/munin/plugins/apt update 7200 12 >/dev/null;
> fi)
> Oct 10 06:25:01 vochomurka /USR/SBIN/CRON[18839]: (munin) CMD (if [ -x /usr/bin/munin-cron ]; then /usr/bin/munin-cron; chmod -R o+r /var/www/munin; fi)
> Oct 10 06:25:01 vochomurka /USR/SBIN/CRON[18840]: (root) CMD (/usr/share/vzctl/scripts/vpsreboot)
> Oct 10 06:25:01 vochomurka /USR/SBIN/CRON[18842]: (root) CMD (/usr/share/vzctl/scripts/vpsnetclean)
> Oct 10 07:19:13 vochomurka syslogd 1.4.1#18: restart.
> _______________________________________________
> Users mailing list
> Users at openvz.org
> https://openvz.org/mailman/listinfo/users
--
E Frank Ball efball at efball.com
More information about the Users
mailing list