[Oracleasm-devel] kernel BUG at include/asm/spinlock.h:146!

Joel Becker Joel.Becker at oracle.com
Thu Aug 25 18:31:28 CDT 2005


On Fri, Aug 26, 2005 at 08:34:36AM +1000, Han Xie wrote:
> I have a 3 node cluster on Firewire, and got this kernel BUG in the /var/log/messages file.  It seems suggest Oracle ASM introduces a kernel bug.  Is this true?  After this error, my cluster failed miserably.

	No, the BUG was in sbp2, the firewire disk driver.  Note the
abort of sbp2 command above, and the stack trace specifying only sbp2
code.  Did you perhaps unplug your disk in the middle of operations?
	Anyway, this isn't an oracleasm issue.  You might want to take
it up with your vendor.

Joel

> 
> The environment is: Redhat AS 4 kernel 2.6.9-11 ELsmp
> 
> ASM installed: oracleasm-2.6.9-11.ELsmp-2.0.0-1
> 		oracleasmlib-2.0.0-1
> 		oracleasm-support-2.0.0-1
> 
> Firewire rpm installed: oracle-firewire-modules-2.6.9-11.ELsmp-1286-1
> 
> Here is the full text of the error in the /var/log/messages:
> 
> Aug 25 02:05:58 melclul14 kernel: ieee1394: sbp2: aborting sbp2 command
> Aug 25 02:05:58 melclul14 kernel: scsi2 : destination target 0, lun 0
> Aug 25 02:05:58 melclul14 kernel:         command = Read (10) 00 0e 28 79 33 00 00 a0 00
> Aug 25 02:05:58 melclul14 kernel: eip: f8a58261
> Aug 25 02:05:58 melclul14 kernel: ------------[ cut here ]------------
> Aug 25 02:05:58 melclul14 kernel: kernel BUG at include/asm/spinlock.h:146!
> Aug 25 02:05:58 melclul14 kernel: invalid operand: 0000 [#1]
> Aug 25 02:05:58 melclul14 kernel: SMP
> Aug 25 02:05:58 melclul14 kernel: Modules linked in: oracleasm(U) parport_pc lp parport autofs4 i2c_dev i2c_core sunrpc dm_mod button battery ac sbp2(U) md5 ipv6 ohci1394(U) ieee1394(U) uhci_hcd ehci_hcd hw_random e1000 e100 mii ext3 jbd ata_piix libata sd_mod scsi_mod
> Aug 25 02:05:58 melclul14 kernel: CPU:    0
> Aug 25 02:05:58 melclul14 kernel: EIP:    0060:[<c02c5fa5>]    Not tainted VLI
> Aug 25 02:05:58 melclul14 kernel: EFLAGS: 00010046   (2.6.9-11.ELsmp)
> Aug 25 02:05:58 melclul14 kernel: EIP is at _spin_lock_irqsave+0x20/0x45
> Aug 25 02:05:58 melclul14 kernel: eax: f8a58261   ebx: 00000082   ecx: c02d972a   edx: c02d972a
> Aug 25 02:05:58 melclul14 kernel: esi: c21f4924   edi: c22b2500   ebp: f751dfa4   esp: f751df50
> Aug 25 02:05:58 melclul14 kernel: ds: 007b   es: 007b   ss: 0068
> Aug 25 02:05:58 melclul14 kernel: Process scsi_eh_2 (pid: 1492, threadinfo=f751d000 task=c22f8cb0)
> Aug 25 02:05:58 melclul14 kernel: Stack: c22b2500 c21f4880 f8a58261 c22b2500 c21f4880 c22b2500 f8a5a4d4 c22b2500
> Aug 25 02:05:58 melclul14 kernel:        00000202 f88470c1 c22b2500 f751dfac c22b2698 f88471f3 f7499048 f7499000
> Aug 25 02:05:58 melclul14 kernel:        00000296 f751dfac f8847dc6 f7499000 f8844d60 f751dfa4 f751dfa4 c22b2518
> Aug 25 02:05:58 melclul14 kernel: Call Trace:
> Aug 25 02:05:58 melclul14 kernel:  [<f8a58261>] sbp2util_find_command_for_SCpnt+0x12/0x6a [sbp2]
> Aug 25 02:05:58 melclul14 kernel:  [<f8a5a4d4>] sbp2scsi_abort+0x2e/0x7a [sbp2]
> Aug 25 02:05:58 melclul14 kernel:  [<f88470c1>] scsi_try_to_abort_cmd+0x3f/0x58 [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<f88471f3>] scsi_eh_abort_cmds+0x52/0xbc [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<f8847dc6>] scsi_unjam_host+0x147/0x16b [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<f8844d60>] __scsi_iterate_devices+0x50/0x58 [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<f8847efc>] scsi_error_handler+0x112/0x15a [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<f8847dea>] scsi_error_handler+0x0/0x15a [scsi_mod]
> Aug 25 02:05:58 melclul14 kernel:  [<c01041f1>] kernel_thread_helper+0x5/0xb
> Aug 25 02:05:58 melclul14 kernel: Code: 81 00 00 00 00 01 c3 f0 ff 00 c3 56 89 c6 53 9c 5b fa 81 78 04 ad 4e ad de 74 18 ff 74 24 08 68 2a 97 2d c0 e8 33 ba e5 ff 59 58 <0f> 0b 92 00 4f 88 2d c0 f0 fe 0e 79 13 f7 c3 00 02 00 00 74 01
> Aug 25 02:05:58 melclul14 kernel:  <0>Fatal exception: panic in 5 seconds
> 
> Cheers.
> Han
> 

> _______________________________________________
> Oracleasm-devel mailing list
> Oracleasm-devel at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/oracleasm-devel


-- 

"I think it would be a good idea."  
        - Mahatma Ghandi, when asked what he thought of Western
          civilization

			http://www.jlbec.org/
			jlbec at evilplan.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
Url : http://oss.oracle.com/pipermail/oracleasm-devel/attachments/20050825/211a6099/attachment.bin


More information about the Oracleasm-devel mailing list