J. Peters | 1 May 2006 17:40
Picon

Re: [Q] FC5 and JFS crash

3 more crashes this weekend:

[root <at> XXXhp360-XX log]# last -f /var/log/wtmp.1
reboot   system boot  2.6.16-1.2080_FC Sun Apr 30 09:55          (23:15)
XXXXXXX  pts/2        :1.0             Sun Apr 30 09:33 - crash  (00:21)
XXXXXXX  pts/1        :1.0             Sun Apr 30 09:33 - crash  (00:21)
XXXXXXX  pts/0        XXX.XX.XX.XXX    Sun Apr 30 09:32 - crash  (00:22)

I am leaning towards hardware issues at this point vs. software. This system was
setup with FC5 out of the box.

> ----- Original Message -----
> From: "Dave Kleikamp" <shaggy <at> austin.ibm.com>
> To: "J. Peters" <japeters <at> mail.com>
> Subject: Re: [Jfs-discussion] [Q] FC5 and JFS crash
> Date: Sat, 29 Apr 2006 09:02:57 -0500
> 
> 
> On Fri, 2006-04-28 at 19:10 -0700, J. Peters wrote:
> > Has anyone kicked the tires on FC5 and JFS?
> >
> > I am getting a crash using this configuration:
> >
> > rpm -q kernel-smp
> > kernel-smp-2.6.15-1.2054_FC5
> > rpm -q jfsutils
> > jfsutils-1.1.10-4

No Oops! in dmesg or /var/log/messages.

(Continue reading)

J. Peters | 5 May 2006 16:15
Picon

Re: [Q] FC5 and JFS crash

Hardware passed diagnostics.

The reason why there is no Kdump "vmcore" is that CONFIG_PROC_VMCORE has to
be set to configure the pseudo filesystem "/proc/vmcore" in the kernel.  This
is not set by default in FC5 as given by "cat /proc/filesystems":

cat /proc/filesystems
nodev   sysfs
nodev   rootfs
nodev   bdev
nodev   proc
nodev   cpuset
nodev   binfmt_misc
nodev   debugfs
nodev   securityfs
nodev   sockfs
nodev   usbfs
nodev   pipefs
nodev   futexfs
nodev   tmpfs
nodev   inotifyfs
nodev   eventpollfs
nodev   devpts
        ext2
nodev   ramfs
nodev   hugetlbfs
        iso9660
nodev   mqueue
        ext3
        jfs
(Continue reading)

Ingo | 21 May 2006 19:37
Picon

JFS added uid, gid and umask options, added officially?

Hi Dave,

in February you provided me with a patch for SUSE 10.0 to enable
the use of JFS-partititions created by OS/2, adding uid, gid and umask.
It really works fine for me, but now SUSE-update proposes to also
update the kernel (which I refused for the time).

My question is therefore whether your patch has been incorporated
in the official distribution (and thus updates), so I can safely update
the kernel and the associated modules?

Best regards,
Ingo

P.S.: any news regarding the 'directory size problem'?

-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
Dave Kleikamp | 22 May 2006 14:49
Picon
Favicon

Re: JFS added uid, gid and umask options, added officially?

On Sun, 2006-05-21 at 19:37 +0200, Ingo wrote:
> Hi Dave,
> 
> in February you provided me with a patch for SUSE 10.0 to enable
> the use of JFS-partititions created by OS/2, adding uid, gid and umask.
> It really works fine for me, but now SUSE-update proposes to also
> update the kernel (which I refused for the time).
> 
> My question is therefore whether your patch has been incorporated
> in the official distribution (and thus updates), so I can safely update
> the kernel and the associated modules?

No, I'm pretty sure SuSE hasn't incorporated that patch.  SuSE's kernel
is based on 2.6.16, and the uid,gid,umask patch was first included in
the 2.6.17-rc* mainline kernel.  I would assume the same patch would
apply to the newer SuSE kernel.

> Best regards,
> Ingo
> 
> P.S.: any news regarding the 'directory size problem'?

Sorry, I've put this off.  I'll try to look at it again soon.  It helps
to keep bugging me.  :-)
--

-- 
David Kleikamp
IBM Linux Technology Center

-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
(Continue reading)

Paul Drynoff | 30 May 2006 10:39
Picon

jfs: What may be wrong?

I have 2.6.16-gentoo-r7, compiled without any debug support,
but recently I found in my log such strange messages, 
what may be wrong?

May 30 12:20:05 localhost [<c01b3026>] __get_metapage+0x3c/0x3a1
May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
May 30 12:20:05 localhost [<c01ae86c>] dtSearch+0x15b/0x7b3
May 30 12:20:05 localhost [<c0156005>] __d_lookup+0x85/0xbc
May 30 12:20:05 localhost [<c01ac5dd>] get_UCSname+0x3d/0xe9
May 30 12:20:05 localhost [<c019f5fa>] jfs_lookup+0xa2/0x15e
May 30 12:20:05 localhost [<c01521de>] poll_freewait+0x35/0x3e
May 30 12:20:05 localhost [<c0159359>] mntput_no_expire+0x14/0x59
May 30 12:20:05 localhost [<c014fdac>] link_path_walk+0xb0/0xbb
May 30 12:20:05 localhost [<c014efd7>] __lookup_hash+0xad/0xc9
May 30 12:20:05 localhost [<c015037c>] do_unlinkat+0x5a/0x122
May 30 12:20:05 localhost [<c01199c5>] sys_gettimeofday+0x23/0x51
May 30 12:20:05 localhost [<c01029e3>] sysenter_past_esp+0x54/0x75
May 30 12:20:05 localhost [<c01b3026>] __get_metapage+0x3c/0x3a1
May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
May 30 12:20:05 localhost [<c01ae86c>] dtSearch+0x15b/0x7b3
May 30 12:20:05 localhost [<c0156005>] __d_lookup+0x85/0xbc
May 30 12:20:05 localhost [<c01ac5dd>] get_UCSname+0x3d/0xe9
May 30 12:20:05 localhost [<c019f5fa>] jfs_lookup+0xa2/0x15e
May 30 12:20:05 localhost [<c01521de>] poll_freewait+0x35/0x3e
May 30 12:20:05 localhost [<c0159359>] mntput_no_expire+0x14/0x59
May 30 12:20:05 localhost [<c014fdac>] link_path_walk+0xb0/0xbb
May 30 12:20:05 localhost [<c011263b>] do_page_fault+0x168/0x4ae
May 30 12:20:05 localhost [<c014efd7>] __lookup_hash+0xad/0xc9
(Continue reading)

Dave Kleikamp | 30 May 2006 17:13
Picon
Favicon

Re: jfs: What may be wrong?

On Tue, 2006-05-30 at 12:39 +0400, Paul Drynoff wrote:
> I have 2.6.16-gentoo-r7, compiled without any debug support,
> but recently I found in my log such strange messages, 
> what may be wrong?

This could be caused by a corrupted directory on disk.  This patch
should do a better job of detecting what the problem is.  You should be
able to fix the problem by running "fsck -f" against the partition.

I don't know what the cause of the corruption may be.  I'd be interested
if it is reproducible after fixing it with fsck (assuming fsck does fix
it).

> May 30 12:20:05 localhost [<c01b3026>] __get_metapage+0x3c/0x3a1
> May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
> May 30 12:20:05 localhost [<e18680a1>] _nv002668rm+0x1d/0x2c [nvidia]
> May 30 12:20:05 localhost [<c01ae86c>] dtSearch+0x15b/0x7b3
> May 30 12:20:05 localhost [<c0156005>] __d_lookup+0x85/0xbc
> May 30 12:20:05 localhost [<c01ac5dd>] get_UCSname+0x3d/0xe9
> May 30 12:20:05 localhost [<c019f5fa>] jfs_lookup+0xa2/0x15e
> May 30 12:20:05 localhost [<c01521de>] poll_freewait+0x35/0x3e
> May 30 12:20:05 localhost [<c0159359>] mntput_no_expire+0x14/0x59
> May 30 12:20:05 localhost [<c014fdac>] link_path_walk+0xb0/0xbb
> May 30 12:20:05 localhost [<c014efd7>] __lookup_hash+0xad/0xc9
> May 30 12:20:05 localhost [<c015037c>] do_unlinkat+0x5a/0x122
> May 30 12:20:05 localhost [<c01199c5>] sys_gettimeofday+0x23/0x51
> May 30 12:20:05 localhost [<c01029e3>] sysenter_past_esp+0x54/0x75

diff -Nurp linux-2.6.16/fs/jfs/jfs_dtree.c linux/fs/jfs/jfs_dtree.c
--- linux-2.6.16/fs/jfs/jfs_dtree.c	2006-03-19 23:53:29.000000000 -0600
(Continue reading)

Nico -telmich- Schottelius | 31 May 2006 09:38

fsck.jfs segfaults

Hello dear jfs-developers!

Have a look at: 

http://creme.schottelius.org/~nico/linux/debug/fs/fsck.jfs-segmentation-fault-2006-05-31

Afair I had this problem some years ago without dm-crypt, but I cannot
remember the situation.

Is it possible, if the disk (hde) is broken, it could cause fsck.jfs
to segfault, because it gets 'non valid' information from the harddisk?

Please CC-me and tell me how to debug / what could be the fault.

Nico

--

-- 
:x
Dave Kleikamp | 31 May 2006 17:03
Picon
Favicon

Re: fsck.jfs segfaults

On Wed, 2006-05-31 at 09:38 +0200, Nico -telmich- Schottelius wrote:
> Hello dear jfs-developers!
> 
> Have a look at: 
> 
> http://creme.schottelius.org/~nico/linux/debug/fs/fsck.jfs-segmentation-fault-2006-05-31
> 
> Afair I had this problem some years ago without dm-crypt, but I cannot
> remember the situation.
> 
> Is it possible, if the disk (hde) is broken, it could cause fsck.jfs
> to segfault, because it gets 'non valid' information from the harddisk?

It's possible, but that would be a bug.

> Please CC-me and tell me how to debug / what could be the fault.

It would be useful to run jfs_fsck under gdb:

gdb /sbin/jfs_fsck
> run /dev/mapper/nirvana
<jfs_fsck should segfault>
> where

> 
> Nico
> 
--

-- 
David Kleikamp
IBM Linux Technology Center
(Continue reading)


Gmane