Forum: >>> Magnum BBS <<<

watchdog: BUG: soft lockup - CPU#0 stuck for 23s! [systemd:1]

From Dennis Clarke@21:1/5 to All on Fri Mar 12 15:30:01 2021

I have seen this for a few months now. The old old netra machine will
run just fine endlessly but if I attempt to perform a package update
then I am always assured to see :

ceres# apt-get update
Get:1 http://deb.debian.org/debian-ports sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports sid/main sparc64 Packages [21.6 MB] Get:3 http://deb.debian.org/debian-ports sid/main all Packages [8,682
kB]
Fetched 30.3 MB in 1min 24s (361 kB/s)

Reading package lists... Done
ceres#

Then try "upgrade" and the machine drops off the network :

Setting up systemd (247.3-1) ...
Timeout, server 172.16.35.61 not responding.

On the serial console we see :

ceres# [2968669.114937] systemd[1]: systemd 247.3-1 running in system
mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +ZSTD -SECCOMP +BLKID
+ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=unified)
[2968669.411163] systemd[1]: Detected architecture sparc64.
[2968696.703129] watchdog: BUG: soft lockup - CPU#0 stuck for 23s!
[systemd:1]
[2968696.794780] Modules linked in: drm(E)
drm_panel_orientation_quirks(E) i2c_core(E) sg(E) envctrl(E)
display7seg(E) flash(E) fuse(E) configfs(E) ip_tables(E) x_tables(E)
autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) crc32c_generic(E)
sd_mod(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)
crct10dif_common(E) ata_generic(E) pata_cmd64x(E) libata(E) sym53c8xx(E) scsi_transport_spi(E) scsi_mod(E) sunhme(E)
[2968697.265208] CPU: 0 PID: 1 Comm: systemd Tainted: G E
5.10.0-1-sparc64 #1 Debian 5.10.5-1
[2968697.391074] TSTATE: 0000000011001604 TPC: 000000000094c4f0 TNPC: 000000000094c4f4 Y: 00000000 Tainted: G E
[2968697.541033] TPC: <misc_open+0x50/0x180>
[2968697.593712] g0: fffff800065a1c80 g1: 0000000000000098 g2:
0000000000000000 g3: 0000000000000002
[2968697.710488] g4: fffff80004197020 g5: 0000000000e93214 g6:
fffff80004198000 g7: 0000000000500008
[2968697.827256] o0: 0000000000f24960 o1: fffff800049ab110 o2:
0000000000040000 o3: 0000000000000000
[2968697.944022] o4: 0000000000000000 o5: 0000000000000000 sp:
fffff8000419af81 ret_pc: 000000000094c4c0
[2968698.065369] RPC: <misc_open+0x20/0x180>
[2968698.118074] l0: 0000000000f24800 l1: fffff800041ce021 l2:
00000003e775fef2 l3: 00000003e775fef2
[2968698.234848] l4: 0000000000020000 l5: fffff8000419b8f0 l6:
0000000000e12000 l7: 0000000000000001
[2968698.351615] i0: fffff8000b791048 i1: fffff800049ab100 i2:
0000000000f24800 i3: 0000000000f24978
[2968698.468381] i4: 00000000000000eb i5: 0000000010040818 i6:
fffff8000419b031 i7: 0000000000665838
[2968698.585168] I7: <chrdev_open+0x98/0x1e0>
[2968698.638996] Call Trace:
[2968698.673323] [<0000000000665838>] chrdev_open+0x98/0x1e0
[2968698.744355] [<000000000065ae30>] do_dentry_open+0x170/0x420 [2968698.819928] [<000000000065ca68>] vfs_open+0x28/0x40
[2968698.886379] [<0000000000671348>] path_openat+0x988/0x1100
[2968698.959682] [<0000000000673dd0>] do_filp_open+0x50/0x100
[2968699.031837] [<000000000065cd30>] do_sys_openat2+0x70/0x180 [2968699.106284] [<000000000065d268>] sys_openat+0x48/0xc0
[2968699.175027] [<0000000000406174>] linux_sparc_syscall+0x34/0x44
~
Type 'go' to resume
ok ~
[EOT]

This is pretty consistent behavior. If someone has any ideas that would
be great. I realize that the old old Netra X1 or Netra T1 is well past
its prime but it does run very stable. I would love to fire up a big
Oracle M4000 unit to try but I have not heard from anyone anywhere that
knows if that can work at all. So for now these old netra units are all
that I can test with.

--
Dennis Clarke
RISC-V/SPARC/PPC/ARM/CISC
UNIX and Linux spoken
GreyBeard and suspenders optional

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Mike Tremaine@21:1/5 to All on Sat Mar 13 19:20:02 2021

On Mar 12, 2021, at 5:56 AM, Dennis Clarke <dclarke@blastwave.org> wrote:

I have seen this for a few months now. The old old netra machine will
run just fine endlessly but if I attempt to perform a package update
then I am always assured to see :

What kernel are you on? I do not have a Netra handy (but I have one in storage, like everyone ;p ). I have an Ultra 5 here so UltraSparc IIi CPU. It does not expect this behavior. Any chance the memory module need to be reseated?

ceres# apt-get update
Get:1 http://deb.debian.org/debian-ports sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports sid/main sparc64 Packages [21.6 MB] Get:3 http://deb.debian.org/debian-ports sid/main all Packages [8,682
kB]
Fetched 30.3 MB in 1min 24s (361 kB/s)

Reading package lists... Done
ceres#

Then try "upgrade" and the machine drops off the network :

I have unstable the mix but as point of reference….

mgt@xray:~$ uname -a
Linux xray 5.10.0-3-sparc64 #1 Debian 5.10.13-1 (2021-02-06) sparc64 GNU/Linux mgt@xray:~$ cat /etc/debian_version
bullseye/sid
mgt@xray:~$ cat /proc/cpuinfo
cpu : TI UltraSparc IIi (Sabre)
fpu : UltraSparc IIi integrated FPU
pmu : ultra12
prom : OBP 3.31.0 2001/07/25 20:36
type : sun4u
ncpus probed : 1
ncpus active : 1
D$ parity tl1 : 0
I$ parity tl1 : 0
Cpu0ClkTck : 0000000013d92d40
cpucaps : flush,stbar,swap,muldiv,v9,mul32,div32,v8plus,vis
MMU Type : Spitfire
MMU PGSZs : 8K,64K,512K,4MB

root@xray:/home/users/mgt# apt update
Get:1 http://deb.debian.org/debian-ports sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports unreleased InRelease [56.6 kB]
Get:3 http://deb.debian.org/debian-ports sid/main all Packages [9,069 kB]
Get:4 http://deb.debian.org/debian-ports sid/main sparc64 Packages [21.5 MB]
Fetched 30.7 MB in 1min 55s (266 kB/s)
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
111 packages can be upgraded. Run 'apt list --upgradable' to see them. root@xray:/home/users/mgt# apt list --upgradeable
Listing… Done
.
.

apt upgrade was then run and 111 packages upgraded without issue….

Setting up systemd (247.3-1) ...
Timeout, server 172.16.35.61 not responding.

On the serial console we see :

ceres# [2968669.114937] systemd[1]: systemd 247.3-1 running in system
mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +ZSTD -SECCOMP +BLKID
+ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=unified)
[2968669.411163] systemd[1]: Detected architecture sparc64.
[2968696.703129] watchdog: BUG: soft lockup - CPU#0 stuck for 23s! [systemd:1]
[2968696.794780] Modules linked in: drm(E)
drm_panel_orientation_quirks(E) i2c_core(E) sg(E) envctrl(E)
display7seg(E) flash(E) fuse(E) configfs(E) ip_tables(E) x_tables(E) autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) crc32c_generic(E)
sd_mod(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)
crct10dif_common(E) ata_generic(E) pata_cmd64x(E) libata(E) sym53c8xx(E) scsi_transport_spi(E) scsi_mod(E) sunhme(E)
[2968697.265208] CPU: 0 PID: 1 Comm: systemd Tainted: G E 5.10.0-1-sparc64 #1 Debian 5.10.5-1
[2968697.391074] TSTATE: 0000000011001604 TPC: 000000000094c4f0 TNPC: 000000000094c4f4 Y: 00000000 Tainted: G E
[2968697.541033] TPC: <misc_open+0x50/0x180>
[2968697.593712] g0: fffff800065a1c80 g1: 0000000000000098 g2: 0000000000000000 g3: 0000000000000002
[2968697.710488] g4: fffff80004197020 g5: 0000000000e93214 g6: fffff80004198000 g7: 0000000000500008
[2968697.827256] o0: 0000000000f24960 o1: fffff800049ab110 o2: 0000000000040000 o3: 0000000000000000
[2968697.944022] o4: 0000000000000000 o5: 0000000000000000 sp: fffff8000419af81 ret_pc: 000000000094c4c0
[2968698.065369] RPC: <misc_open+0x20/0x180>
[2968698.118074] l0: 0000000000f24800 l1: fffff800041ce021 l2: 00000003e775fef2 l3: 00000003e775fef2
[2968698.234848] l4: 0000000000020000 l5: fffff8000419b8f0 l6: 0000000000e12000 l7: 0000000000000001
[2968698.351615] i0: fffff8000b791048 i1: fffff800049ab100 i2: 0000000000f24800 i3: 0000000000f24978
[2968698.468381] i4: 00000000000000eb i5: 0000000010040818 i6: fffff8000419b031 i7: 0000000000665838
[2968698.585168] I7: <chrdev_open+0x98/0x1e0>
[2968698.638996] Call Trace:
[2968698.673323] [<0000000000665838>] chrdev_open+0x98/0x1e0
[2968698.744355] [<000000000065ae30>] do_dentry_open+0x170/0x420 [2968698.819928] [<000000000065ca68>] vfs_open+0x28/0x40
[2968698.886379] [<0000000000671348>] path_openat+0x988/0x1100 [2968698.959682] [<0000000000673dd0>] do_filp_open+0x50/0x100 [2968699.031837] [<000000000065cd30>] do_sys_openat2+0x70/0x180 [2968699.106284] [<000000000065d268>] sys_openat+0x48/0xc0
[2968699.175027] [<0000000000406174>] linux_sparc_syscall+0x34/0x44
~
Type 'go' to resume
ok ~
[EOT]

This is pretty consistent behavior. If someone has any ideas that would
be great. I realize that the old old Netra X1 or Netra T1 is well past
its prime but it does run very stable. I would love to fire up a big
Oracle M4000 unit to try but I have not heard from anyone anywhere that
knows if that can work at all. So for now these old netra units are all
that I can test with.

--
Dennis Clarke
RISC-V/SPARC/PPC/ARM/CISC
UNIX and Linux spoken
GreyBeard and suspenders optional

The Netra’s have few different devices wonder if there is a bug in one of those drivers?

-Mike

<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><
div class="">On Mar 12, 2021, at 5:56 AM, Dennis Clarke <<a href="mailto:dclarke@blastwave.org" class="">dclarke@blastwave.org</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div class=""><br class="">I have seen this for a
few months now. The old old netra machine will<br class="">run just fine endlessly but if I attempt to perform a package update<br class="">then I am always assured to see :<br class=""><br class=""><br class=""></div></div></blockquote><div><br class="">
</div><div>What kernel are you on? I do not have a Netra handy (but I have one in storage,  like everyone ;p ).  I have an Ultra 5 here so UltraSparc IIi CPU. It does not expect this behavior.  Any chance the memory module need to be
reseated?</div><br class=""><blockquote type="cite" class=""><div class=""><div class="">ceres# apt-get update<br class="">Get:1 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> sid InRelease [55.3 kB]<br class=
"">Get:2 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> sid/main sparc64 Packages [21.6 MB]<br class="">Get:3 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> sid/
main all Packages [8,682<br class="">kB]<br class="">Fetched 30.3 MB in 1min 24s (361 kB/s)<br class=""><br class="">Reading package lists... Done<br class="">ceres#<br class=""><br class="">Then try "upgrade" and the machine drops off the network :<br
class=""><br class=""></div></div></blockquote><div><br class=""></div><div>I have unstable the mix but as point of reference….</div><div><br class=""></div><div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-
family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures; color: rgb(47, 180, 29);" class=""><b class="">mgt@xray</b></span><span style="font-variant-ligatures: no-common-ligatures;" class="">:</span><span style="font-variant-
ligatures: no-common-ligatures; color: rgb(64, 11, 217);" class=""><b class="">~</b></span><span style="font-variant-ligatures: no-common-ligatures;" class="">$ uname -a</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-
height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Linux xray 5.10.0-3-sparc64 #1 Debian 5.10.13-1 (2021-02-06) sparc64 GNU/Linux</span></div><div style="margin: 0px; font-stretch: normal;
font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures; color: rgb(47, 180, 29);" class=""><b class="">mgt@xray</b></span><span style="font-variant-ligatures: no-common-ligatures;"
class="">:</span><span style="font-variant-ligatures: no-common-ligatures; color: rgb(64, 11, 217);" class=""><b class="">~</b></span><span style="font-variant-ligatures: no-common-ligatures;" class="">$ cat /etc/debian_version </span></div><div
style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">bullseye/sid</span></div><div style="margin: 0px; font-stretch: normal; font-
size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures; color: rgb(47, 180, 29);" class=""><b class="">mgt@xray</b></span><span style="font-variant-ligatures: no-common-ligatures;" class="">
:</span><span style="font-variant-ligatures: no-common-ligatures; color: rgb(64, 11, 217);" class=""><b class="">~</b></span><span style="font-variant-ligatures: no-common-ligatures;" class="">$ cat /proc/cpuinfo </span></div><div style="margin: 0px;
font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">cpu<span class="Apple-tab-span" style="white-space: pre;"> </span>: TI UltraSparc IIi (Sabre)</
span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">fpu<span class="Apple-tab-span" style="white-space: pre;"> <
/span>: UltraSparc IIi integrated FPU</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">pmu<span class="Apple-
tab-span" style="white-space: pre;"> </span>: ultra12</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">prom<
span class="Apple-tab-span" style="white-space: pre;"> </span>: OBP 3.31.0 2001/07/25 20:36</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-
ligatures: no-common-ligatures;" class="">type<span class="Apple-tab-span" style="white-space: pre;"> </span>: sun4u</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span
style="font-variant-ligatures: no-common-ligatures;" class="">ncpus probed<span class="Apple-tab-span" style="white-space: pre;"> </span>: 1</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family:
Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">ncpus active<span class="Apple-tab-span" style="white-space: pre;"> </span>: 1</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height:
normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">D$ parity tl1<span class="Apple-tab-span" style="white-space: pre;"> </span>: 0</span></div><div style="margin: 0px; font-stretch: normal; font-size:
17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">I$ parity tl1<span class="Apple-tab-span" style="white-space: pre;"> </span>: 0</span></div><div style="margin: 0px; font-
stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Cpu0ClkTck<span class="Apple-tab-span" style="white-space: pre;"> </span>: 0000000013d92d40</span></

<div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">cpucaps<span class="Apple-tab-span" style="white-space: pre;"> </

span>: flush,stbar,swap,muldiv,v9,mul32,div32,v8plus,vis</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">
MMU Type<span class="Apple-tab-span" style="white-space: pre;"> </span>: Spitfire</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-
common-ligatures;" class="">MMU PGSZs<span class="Apple-tab-span" style="white-space: pre;"> </span>: 8K,64K,512K,4MB</span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span
style="font-variant-ligatures: no-common-ligatures;" class=""><br class=""></span></div><div style="margin: 0px; font-stretch: normal; font-size: 17px; line-height: normal; font-family: Menlo;" class=""><span style="font-variant-ligatures: no-common-
ligatures;" class=""><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">root@xray:/home/users/mgt# apt update</span></div><div style="margin: 0px; font-stretch:
normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Get:1 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> sid InRelease [55.3 kB]</span></div><div style="
margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Get:2 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> unreleased InRelease [
56.6 kB]</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Get:3 <a href="http://deb.debian.org/debian-ports" class="">http://deb.debian.org/
debian-ports</a> sid/main all Packages [9,069 kB]                                                         &
nbsp;                    </span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Get:4 <a href="http:/
/deb.debian.org/debian-ports" class="">http://deb.debian.org/debian-ports</a> sid/main sparc64 Packages [21.5 MB]                                        &
nbsp;                                  </span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; color: rgb(159, 160, 28);" class=""><span style="
font-variant-ligatures: no-common-ligatures; color: rgb(0, 0, 0);" class="">Fetched 30.7 MB in 1min 55s (266 kB/s)</span><span style="font-variant-ligatures: no-common-ligatures;" class="">                  &
nbsp;                                                                       &
nbsp;                      </span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Reading
package lists... Done</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Building dependency tree... Done</span></div><div style="margin: 0px; font-
stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Reading state information... Done</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="
font-variant-ligatures: no-common-ligatures;" class="">111 packages can be upgraded. Run 'apt list --upgradable' to see them.</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures:
no-common-ligatures;" class="">root@xray:/home/users/mgt# apt list --upgradeable</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">Listing… Done<
/span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class="">.</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><
span style="font-variant-ligatures: no-common-ligatures;" class="">.</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><span style="font-variant-ligatures: no-common-ligatures;" class=""><br class=""></span></div><
div style="margin: 0px; font-stretch: normal; line-height: normal;" class="">apt upgrade was then run and 111 packages upgraded without issue….</div><div style="margin: 0px; font-stretch: normal; line-height: normal;" class=""><br class=""></div></span>
</div></div><blockquote type="cite" class=""><div class=""><div class="">Setting up systemd (247.3-1) ...<br class="">Timeout, server 172.16.35.61 not responding.<br class=""><br class="">On the serial console we see :<br class=""><br class="">ceres# [
2968669.114937] systemd[1]: systemd 247.3-1 running in system<br class="">mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP<br class="">+LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +ZSTD -SECCOMP +BLKID<br class="">+ELFUTILS +KMOD +IDN2 -
IDN +PCRE2 default-hierarchy=unified)<br class="">[2968669.411163] systemd[1]: Detected architecture sparc64.<br class="">[2968696.703129] watchdog: BUG: soft lockup - CPU#0 stuck for 23s!<br class="">[systemd:1]<br class="">[2968696.794780] Modules
linked in: drm(E)<br class="">drm_panel_orientation_quirks(E) i2c_core(E) sg(E) envctrl(E)<br class="">display7seg(E) flash(E) fuse(E) configfs(E) ip_tables(E) x_tables(E)<br class="">autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) crc32c_generic(E)<br
class="">sd_mod(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)<br class="">crct10dif_common(E) ata_generic(E) pata_cmd64x(E) libata(E) sym53c8xx(E)<br class="">scsi_transport_spi(E) scsi_mod(E) sunhme(E)<br class="">[2968697.265208] CPU: 0 PID: 1 Comm:
systemd Tainted: G            E<br class=""> 5.10.0-1-sparc64 #1 Debian 5.10.5-1<br class="">[2968697.391074] TSTATE: 0000000011001604 TPC: 000000000094c4f0 TNPC:<br class="">000000000094c4f4 Y:
00000000    Tainted: G            E<br class="">[2968697.541033] TPC: <misc_open+0x50/0x180><br class="">[2968697.593712] g0: fffff800065a1c80 g1: 0000000000000098 g2:<br class=""

0000000000000000 g3: 0000000000000002<br class="">[2968697.710488] g4: fffff80004197020 g5: 0000000000e93214 g6:<br class="">fffff80004198000 g7: 0000000000500008<br class="">[2968697.827256] o0: 0000000000f24960 o1: fffff800049ab110 o2:<br class="">

0000000000040000 o3: 0000000000000000<br class="">[2968697.944022] o4: 0000000000000000 o5: 0000000000000000 sp:<br class="">fffff8000419af81 ret_pc: 000000000094c4c0<br class="">[2968698.065369] RPC: <misc_open+0x20/0x180><br class="">[2968698.
118074] l0: 0000000000f24800 l1: fffff800041ce021 l2:<br class="">00000003e775fef2 l3: 00000003e775fef2<br class="">[2968698.234848] l4: 0000000000020000 l5: fffff8000419b8f0 l6:<br class="">0000000000e12000 l7: 0000000000000001<br class="">[2968698.
351615] i0: fffff8000b791048 i1: fffff800049ab100 i2:<br class="">0000000000f24800 i3: 0000000000f24978<br class="">[2968698.468381] i4: 00000000000000eb i5: 0000000010040818 i6:<br class="">fffff8000419b031 i7: 0000000000665838<br class="">[2968698.
585168] I7: <chrdev_open+0x98/0x1e0><br class="">[2968698.638996] Call Trace:<br class="">[2968698.673323] [<0000000000665838>] chrdev_open+0x98/0x1e0<br class="">[2968698.744355] [<000000000065ae30>] do_dentry_open+0x170/0x420<br class=
"">[2968698.819928] [<000000000065ca68>] vfs_open+0x28/0x40<br class="">[2968698.886379] [<0000000000671348>] path_openat+0x988/0x1100<br class="">[2968698.959682] [<0000000000673dd0>] do_filp_open+0x50/0x100<br class="">[2968699.031837]
[<000000000065cd30>] do_sys_openat2+0x70/0x180<br class="">[2968699.106284] [<000000000065d268>] sys_openat+0x48/0xc0<br class="">[2968699.175027] [<0000000000406174>] linux_sparc_syscall+0x34/0x44<br class="">~<br class="">Type  '
go' to resume<br class="">ok ~<br class="">[EOT]<br class=""><br class="">This is pretty consistent behavior. If someone has any ideas that would<br class="">be great. I realize that the old old Netra X1 or Netra T1 is well past<br class="">its prime but
it does run very stable.  I would love to fire up a big<br class="">Oracle M4000 unit to try but I have not heard from anyone anywhere that<br class="">knows if that can work at all. So for now these old netra units are all<br class="">that I can
test with.<br class=""><br class=""><br class="">-- <br class="">Dennis Clarke<br class="">RISC-V/SPARC/PPC/ARM/CISC<br class="">UNIX and Linux spoken<br class="">GreyBeard and suspenders optional<br class=""></div></div></blockquote><br class=""></div><

The Netra’s have few different devices wonder if there is a bug in one of those drivers?</div><div><br class=""></div><div>-Mike</div><br class=""></body></html>

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Mike Tremaine@21:1/5 to All on Sat Mar 13 20:00:03 2021

ADDED: I wonder if it’s systemd specifically that causes this for you based on the console output.

I have this in dmesg which matches the start of your output.

[Mar13 09:26] systemd[1]: systemd 247.3-3 running in system mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +ZSTD -SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=unified)
[ +0.000928] systemd[1]: Detected architecture sparc64.
[ +2.019373] systemd[1]: /lib/systemd/system/plymouth-start.service:16: Unit configured to use KillMode=none. This is unsafe, as it disables systemd's process lifecycle management for the service. Please update your service to use a safer KillMode=,
such as 'mixed' or 'control-group'. Support for KillMode=none is deprecated and will eventually be removed.
[ +5.426590] systemd-journald[181]: Received client request to relinquish /var/log/journal/5cbf90a5ea124196a208a4297d97ce25 access.
[ +0.099645] systemd-journald[181]: Received SIGTERM from PID 1 (systemd).
[ +0.016174] systemd[1]: Stopping Journal Service...
[ +0.017883] systemd[1]: systemd-journald.service: Succeeded.
[ +0.010165] systemd[1]: Stopped Journal Service.
[ +0.000494] systemd[1]: systemd-journald.service: Consumed 2.275s CPU time.
[ +0.088359] systemd[1]: Starting Journal Service...
[ +0.181377] systemd[1]: Started Journal Service.
[ +0.359133] systemd-journald[2434]: Received client request to flush runtime journal.

In /var/log/apt/term.log we see it hits the systemd about the right time...

Log started: 2021-03-13 09:25:35
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../dash_0.5.11+git20210120+802ebd4-1_sparc64.deb ... Unpacking dash (0.5.11+git20210120+802ebd4-1) over (0.5.11+git20200708+dd9ef66-5) ...
Setting up dash (0.5.11+git20210120+802ebd4-1) ...
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../gzip_1.10-4_sparc64.deb ...
Unpacking gzip (1.10-4) over (1.10-3) ...
Setting up gzip (1.10-4) ...
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../liblzma5_5.2.5-2_sparc64.deb ...
Unpacking liblzma5:sparc64 (5.2.5-2) over (5.2.5-1.0) ...
Setting up liblzma5:sparc64 (5.2.5-2) ...
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../libnss-systemd_247.3-3_sparc64.deb ...
Unpacking libnss-systemd:sparc64 (247.3-3) over (247.3-1) ...
Preparing to unpack .../libsystemd0_247.3-3_sparc64.deb ...
Unpacking libsystemd0:sparc64 (247.3-3) over (247.3-1) ...
Setting up libsystemd0:sparc64 (247.3-3) ...
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../systemd-timesyncd_247.3-3_sparc64.deb ...
Unpacking systemd-timesyncd (247.3-3) over (247.3-1) ...
Preparing to unpack .../libpam-systemd_247.3-3_sparc64.deb ...
Unpacking libpam-systemd:sparc64 (247.3-3) over (247.3-1) ...
Preparing to unpack .../systemd_247.3-3_sparc64.deb ...
Unpacking systemd (247.3-3) over (247.3-1) ...
Preparing to unpack .../udev_247.3-3_sparc64.deb ...
Unpacking udev (247.3-3) over (247.3-1) ...
Preparing to unpack .../libudev1_247.3-3_sparc64.deb ...
Unpacking libudev1:sparc64 (247.3-3) over (247.3-1) ...
Setting up libudev1:sparc64 (247.3-3) ...
Setting up systemd-timesyncd (247.3-3) ...
Setting up systemd (247.3-3) ...
(Reading database ... 150966 files and directories currently installed.) Preparing to unpack .../systemd-sysv_247.3-3_sparc64.deb ...
Unpacking systemd-sysv (247.3-3) over (247.3-1) ...

-Mike

On Mar 13, 2021, at 9:29 AM, Mike Tremaine <mgt@stellarcore.net> wrote:

On Mar 12, 2021, at 5:56 AM, Dennis Clarke <dclarke@blastwave.org <mailto:dclarke@blastwave.org>> wrote:

I have seen this for a few months now. The old old netra machine will
run just fine endlessly but if I attempt to perform a package update
then I am always assured to see :

What kernel are you on? I do not have a Netra handy (but I have one in storage, like everyone ;p ). I have an Ultra 5 here so UltraSparc IIi CPU. It does not expect this behavior. Any chance the memory module need to be reseated?

ceres# apt-get update
Get:1 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid/main sparc64 Packages [21.6 MB]
Get:3 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid/main all Packages [8,682
kB]
Fetched 30.3 MB in 1min 24s (361 kB/s)

Reading package lists... Done
ceres#

Then try "upgrade" and the machine drops off the network :

I have unstable the mix but as point of reference….

mgt@xray:~$ uname -a
Linux xray 5.10.0-3-sparc64 #1 Debian 5.10.13-1 (2021-02-06) sparc64 GNU/Linux
mgt@xray:~$ cat /etc/debian_version
bullseye/sid
mgt@xray:~$ cat /proc/cpuinfo
cpu : TI UltraSparc IIi (Sabre)
fpu : UltraSparc IIi integrated FPU
pmu : ultra12
prom : OBP 3.31.0 2001/07/25 20:36
type : sun4u
ncpus probed : 1
ncpus active : 1
D$ parity tl1 : 0
I$ parity tl1 : 0
Cpu0ClkTck : 0000000013d92d40
cpucaps : flush,stbar,swap,muldiv,v9,mul32,div32,v8plus,vis
MMU Type : Spitfire
MMU PGSZs : 8K,64K,512K,4MB

root@xray:/home/users/mgt# apt update
Get:1 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> unreleased InRelease [56.6 kB]
Get:3 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid/main all Packages [9,069 kB]
Get:4 http://deb.debian.org/debian-ports <http://deb.debian.org/debian-ports> sid/main sparc64 Packages [21.5 MB]
Fetched 30.7 MB in 1min 55s (266 kB/s)
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
111 packages can be upgraded. Run 'apt list --upgradable' to see them. root@xray:/home/users/mgt# apt list --upgradeable
Listing… Done
.
.

apt upgrade was then run and 111 packages upgraded without issue….

Setting up systemd (247.3-1) ...
Timeout, server 172.16.35.61 not responding.

On the serial console we see :

ceres# [2968669.114937] systemd[1]: systemd 247.3-1 running in system
mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP
+LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +ZSTD -SECCOMP +BLKID
+ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=unified)
[2968669.411163] systemd[1]: Detected architecture sparc64.
[2968696.703129] watchdog: BUG: soft lockup - CPU#0 stuck for 23s!
[systemd:1]
[2968696.794780] Modules linked in: drm(E)
drm_panel_orientation_quirks(E) i2c_core(E) sg(E) envctrl(E)
display7seg(E) flash(E) fuse(E) configfs(E) ip_tables(E) x_tables(E)
autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) crc32c_generic(E)
sd_mod(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E)
crct10dif_common(E) ata_generic(E) pata_cmd64x(E) libata(E) sym53c8xx(E)
scsi_transport_spi(E) scsi_mod(E) sunhme(E)
[2968697.265208] CPU: 0 PID: 1 Comm: systemd Tainted: G E
5.10.0-1-sparc64 #1 Debian 5.10.5-1
[2968697.391074] TSTATE: 0000000011001604 TPC: 000000000094c4f0 TNPC:
000000000094c4f4 Y: 00000000 Tainted: G E
[2968697.541033] TPC: <misc_open+0x50/0x180>
[2968697.593712] g0: fffff800065a1c80 g1: 0000000000000098 g2:
0000000000000000 g3: 0000000000000002
[2968697.710488] g4: fffff80004197020 g5: 0000000000e93214 g6:
fffff80004198000 g7: 0000000000500008
[2968697.827256] o0: 0000000000f24960 o1: fffff800049ab110 o2:
0000000000040000 o3: 0000000000000000
[2968697.944022] o4: 0000000000000000 o5: 0000000000000000 sp:
fffff8000419af81 ret_pc: 000000000094c4c0
[2968698.065369] RPC: <misc_open+0x20/0x180>
[2968698.118074] l0: 0000000000f24800 l1: fffff800041ce021 l2:
00000003e775fef2 l3: 00000003e775fef2
[2968698.234848] l4: 0000000000020000 l5: fffff8000419b8f0 l6:
0000000000e12000 l7: 0000000000000001
[2968698.351615] i0: fffff8000b791048 i1: fffff800049ab100 i2:
0000000000f24800 i3: 0000000000f24978
[2968698.468381] i4: 00000000000000eb i5: 0000000010040818 i6:
fffff8000419b031 i7: 0000000000665838
[2968698.585168] I7: <chrdev_open+0x98/0x1e0>
[2968698.638996] Call Trace:
[2968698.673323] [<0000000000665838>] chrdev_open+0x98/0x1e0
[2968698.744355] [<000000000065ae30>] do_dentry_open+0x170/0x420
[2968698.819928] [<000000000065ca68>] vfs_open+0x28/0x40
[2968698.886379] [<0000000000671348>] path_openat+0x988/0x1100
[2968698.959682] [<0000000000673dd0>] do_filp_open+0x50/0x100
[2968699.031837] [<000000000065cd30>] do_sys_openat2+0x70/0x180
[2968699.106284] [<000000000065d268>] sys_openat+0x48/0xc0
[2968699.175027] [<0000000000406174>] linux_sparc_syscall+0x34/0x44
~
Type 'go' to resume
ok ~
[EOT]

This is pretty consistent behavior. If someone has any ideas that would
be great. I realize that the old old Netra X1 or Netra T1 is well past
its prime but it does run very stable. I would love to fire up a big
Oracle M4000 unit to try but I have not heard from anyone anywhere that
knows if that can work at all. So for now these old netra units are all
that I can test with.

--
Dennis Clarke
RISC-V/SPARC/PPC/ARM/CISC
UNIX and Linux spoken
GreyBeard and suspenders optional

The Netra’s have few different devices wonder if there is a bug in one of those drivers?

-Mike

[continued in next message]

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Dennis Clarke@21:1/5 to Mike Tremaine on Sat Mar 13 20:40:01 2021

On 3/13/21 5:29 PM, Mike Tremaine wrote:

On Mar 12, 2021, at 5:56 AM,
Dennis Clarke <dclarke@blastwave.org> wrote:

I have seen this for a few months now. The old old netra machine will
run just fine endlessly but if I attempt to perform a package update
then I am always assured to see :

What kernel are you on?

Let me address that *after* we look at the hardware diagnostics. The old
Netra t1 105 is pretty much indestructible with the exception being that
the internal battery will die. Which mine has. However this affects
nothing as the machine can be left plugged in and powered on for years
and the firmware variables are trivial to setup if needed. I did do a
power down and then left it cold for a day or two. That is a good way to
see the full hardware diagnostics when the power plug is put back in.

Thus :

LOMlite console
Standby

LOM event: LOM reset

poweron

LOM event: power on

ps/2 kbd check: 0000.0000.0000.00fe
LOM event: Fan 1 failed

LOM event: Fault LED 3Hz

Checking Sun KB Done
%o0 = 0000.0000.0055.4001

Executing Power On SelfTest

SPARCengine(tm)Ultra CP 1500 POST 1.17 ME created 03/06/00
WARRNING: NVRAM battery is either bad or just replaced!
Time Stamp [hour:min:sec] 33:30:02

Init POST BSS
Init System BSS

Probing system keyboard : Done
DMMU TLB Tags
DMMU TLB Tag Access Test
DMMU TLB RAM
DMMU TLB RAM Access Test
Ecache Tests
Probe Ecache
ecache_size = 0x00200000
Ecache RAM Addr Test
Ecache Tag Addr Test
Ecache RAM Test
Ecache Tag Test
Invalidate Ecache Tags
All CPU Basic Tests
V9 Instruction Test
CPU Tick and Tick Compare Reg Test
CPU Soft Trap Test
CPU Softint Reg and Int Test
All Basic MMU Tests
DMMU Primary Context Reg Test
DMMU Secondary Context Reg Test
DMMU TSB Reg Test
DMMU Tag Access Reg Test
DMMU VA Watchpoint Reg Test
DMMU PA Watchpoint Reg Test
IMMU TSB Reg Test
IMMU Tag Access Reg Test
IMMU TLB RAM Access Test
IMMU TLB Tag Access Test
All Basic Cache Tests
Dcache RAM Test
Dcache Tag Test
Icache RAM Test
Icache Tag Test
Icache Next Test
Icache Predecode Test
UltraSPARC IIi MCU Control & Status Regs Init and Tests
Init UltraSPARC IIi MCU Control & Status Regs
CPU speed : 440 Mhz, mc1 set : 0x544cb9dd
Memory Probe and Init
Probe Memory
INFO: All the memory Group in 10 bit column mode
Group 0: 256MB
Group 1: 256MB
Group 2: 256MB
Group 3: 256MB
Malloc Post Memory
Init Post Memory
..........
Memory Addr w/ Ecache
Map PROM/STACK/NVRAM in DMMU
Load Post In Memory
Run POST from MEM
..........
loaded POST in memory
Update Master Stack/Frame Pointers
All FPU Basic Tests
FPU Regs Test
FPU State Reg Test
FPU Functional Test
FPU Trap Test
Memory Tests
Init Memory
...............
................
................
................
................
................
................
................
Memory Addr w/ Ecache Test
ECC Memory Addr Test
Block Memory Addr Test
Block Memory Test
...............
...............

................
................

................
................

................
................

................
................

................
................

................
................

................
................

ECC Blk Memory Test
...............
...............

................
................

................
................

................
................

................
................

................
................

................
................

................
................

All Basic UltraSPARC IIi PBM Tests
Init UltraSPARC IIi PBM
PIO Decoder and BCT Test
PCI Byte Enable Test
UltraSPARC IIi IOMMU Regs Test
UltraSPARC IIi IOMMU RAM NTA Test
UltraSPARC IIi IOMMU CAM NTA Test
UltraSPARC IIi IOMMU RAM Address Test
UltraSPARC IIi IOMMU CAM Address Test
IOMMU TLB Compare Test
IOMMU TLB Flush Test
PBM Control/Status Reg Test
PBM Diag Reg Test
UltraSPARC IIi PBM Regs Test
All Advanced CPU Tests
DMMU Hit/Miss Test
DMMU Little Endian Test
IU ASI Access Test
FPU ASI Access Test
Ecache Thrash Test
All CPU Error Reporting Tests
CPU Addr Align Trap Test
DMMU Access Priv Page Test
DMMU Write Protected Page Test
All Advanced UltraSPARC IIi PBM Tests
Init UltraSPARC IIi PBM
Consist DMA Wr, IOMMU hit Ebus Test
All Basic Cheerio Tests
Cheerio Ebus PCI Config Space Test
Cheerio Ethernet PCI Config Space Test
Cheerio Ebus Engine Reg Test
Cheerio Init
All Basic I2c Tests
Init i2c bus
Thermister Reading Test
Thermister Position Readings (in Hex)
CPU 0x70
All Basic PCI-PCI Bridge Tests
PCI-PCI Bridge Config Space Test
All Basic Symbios 875 SCSI controller Tests
Symbios 875 SCSI controller PCI Config Space Test

Extended POST:
Start Extended POST : No EXT POST is found

Power On Selftest Completed
Status = 0000.0000.0000.0000 ffff.ffff.f100.1db0 019f.3333.3a50.0011

Software Power ON

@(#) SPARCengine(tm)Ultra CP 1500 3.10.27 ME created 2000/06/22 16:45
Enter Checking KB
ps/2 kbd check: 0000.0000.0000.00fe
Checking Sun KB
Clearing E$ Tags Done
Clearing I/D TLBs Done
Probing Memory
Group Info[0000.0000.0000.0003] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0002] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0001] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0000] : 0000.0000.0000.0110
Done
Clearing Memory...Done
MEM BASE = 0000.0000.3800.0000
MEM SIZE = 0000.0000.0800.0000
MMUs ON
Copy Done
PC = 0000.01ff.f000.30dc
PC = 0000.0000.0000.3120
Decompressing into Memory Done
Size = 0000.0000.0008.7710
ttya initialized
flashprom flashprom Incorrect configuration checksum;
Setting NVRAM parameters to default values.
Setting diag-switch? NVRAM parameter to true
Reset Control: BXIR:0 BPOR:0 SXIR:0 SPOR:1 POR:0
UltraSPARC-IIi Version 9.1 (E$=2 MB) 2-2 module
Advanced PCI Bridge Version 1.3
Probing Memory Group #0 128 + 128 : 256 Megabytes
Probing Memory Group #1 128 + 128 : 256 Megabytes
Probing Memory Group #2 128 + 128 : 256 Megabytes
Probing Memory Group #3 128 + 128 : 256 Megabytes
Initialise 2nd I2c controller
Environmental monitoring: Enabled
i2c adc gpio gpio
i2c Probing Floppy: No drives detected
Probing /pci@1f,0/pci@1,1 at Device 1 network
Probing /pci@1f,0/pci@1,1 at Device 2 scsi disk tape
Probing /pci@1f,0/pci@1,1 at Device 3 network
Probing /pci@1f,0/pci@1 at Device 1 pci
Probing /pci@1f,0/pci@1/pci@1 at Device 0 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 1 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 2 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 3 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 4 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 5 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 6 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 7 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 8 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 9 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device a Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device b Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device c Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device d Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device e ide disk cdrom
Probing /pci@1f,0/pci@1/pci@1 at Device f Nothing there

Netra t1 (UltraSPARC-IIi 440MHz), No Keyboard
OpenBoot 3.10.27 ME, 1024 MB memory installed, Serial #12731976.
Ethernet address 8:0:20:c2:46:48, Host ID: 80c24648.

Boot device: net File and args:
Using External Transceiver - Link Up.
Timeout waiting for ARP/RARP packet
Timeout waiting for ARP/RARP packet
Timeout waiting for ARP/RARP packet
Timeout waiting for ARP/RARP packet
~
ok

I did sent a BRK to the serial port and that drops us into the firmware
"ok" prompt. There is a failed fan but in fact the fan is entirely not
there. At all. I removed it because it had failed five or six years ago
and getting another one is just annoying. Also it is not really needed.

We can see that there is 1G of ECC memory and the memory passes all the
basic tests.

Now I setup a few of the firmware variables and reset the unit :

ok printenv
Variable Name Value Default Value

tpe-link-test? true true
scsi-initiator-id 7 7
keyboard-click? false false
keymap
watchdog-timeout 65535 65535
watchdog-enable? false false
ttyb-rts-dtr-off false false
ttyb-ignore-cd true true
ttya-rts-dtr-off false false
ttya-ignore-cd true true
ttyb-mode 9600,8,n,1,- 9600,8,n,1,-
ttya-mode 9600,8,n,1,- 9600,8,n,1,- cpci-probe-list 0,1,2,3,4,5,6,7,8,9,a,b, ...
0,1,2,3,4,5,6,7,8,9,a,b, ...
pcia-probe-list 1 1
pcib-probe-list 1,2,3 1,2,3
mfg-mode off off
diag-level max max
d7s-flipped? false false
#power-cycles 1431655766
system-board-serial# 20 00 00 00 00 01 01 10 ...
system-board-date 00 10 00 00 10 14 01 10 ...
last-poweroff-cause 0 0
env-monitor enabled enabled shutdown-temperature 72 72
warning-temperature 67 67
fcode-debug? false false
output-device ttya ttya
input-device ttya ttya
load-base 16384 16384
auto-boot-retry? false false
use-boot-table? false false
boot-command boot boot
auto-boot? true true
watchdog-reboot? false false
diag-file
diag-device net net
boot-file
boot-device disk net disk net local-mac-address? false false
net-timeout 0 0
ansi-terminal? true true
screen-#columns 80 80
screen-#rows 34 34
silent-mode? false false
use-nvramrc? false false
nvramrc
security-mode none
security-password
security-#badlogins 256
oem-logo
oem-logo? false false
oem-banner
oem-banner? false false
hardware-revision 55 55 54 55 15 55 55 45 ...
last-hardware-update UUUUQUUU ...
diag-switch? true false
ok setenv #power-cycles 0
#power-cycles = 0
ok setenv auto-boot? false
auto-boot? = false
ok

Easy. Now a full reset :

ok reset-all
Resetting ...

Software Power ON

@(#) SPARCengine(tm)Ultra CP 1500 3.10.27 ME created 2000/06/22 16:45
Enter Checking KB
ps/2 kbd check: 0000.0000.0000.00fe
Checking Sun KB
Clearing E$ Tags Done
Clearing I/D TLBs Done
Probing Memory
Group Info[0000.0000.0000.0003] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0002] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0001] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0000] : 0000.0000.0000.0110
Done
Clearing Memory...Done
MEM BASE = 0000.0000.3800.0000
MEM SIZE = 0000.0000.0800.0000
MMUs ON
Copy Done
PC = 0000.01ff.f000.30dc
PC = 0000.0000.0000.3120
Decompressing into Memory Done
Size = 0000.0000.0008.7710
ttya initialized
flashprom flashprom Reset Control: BXIR:0 BPOR:0 SXIR:0 SPOR:1 POR:0 UltraSPARC-IIi Version 9.1 (E$=2 MB) 2-2 module
Advanced PCI Bridge Version 1.3
Probing Memory Group #0 128 + 128 : 256 Megabytes
Probing Memory Group #1 128 + 128 : 256 Megabytes
Probing Memory Group #2 128 + 128 : 256 Megabytes
Probing Memory Group #3 128 + 128 : 256 Megabytes
Initialise 2nd I2c controller
Environmental monitoring: Enabled
i2c adc gpio gpio
i2c Probing Floppy: No drives detected
Probing /pci@1f,0/pci@1,1 at Device 1 network
Probing /pci@1f,0/pci@1,1 at Device 2 scsi disk tape
Probing /pci@1f,0/pci@1,1 at Device 3 network
Probing /pci@1f,0/pci@1 at Device 1 pci
Probing /pci@1f,0/pci@1/pci@1 at Device 0 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 1 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 2 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 3 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 4 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 5 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 6 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 7 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 8 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 9 Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device a Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device b Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device c Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device d Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device e ide disk cdrom
Probing /pci@1f,0/pci@1/pci@1 at Device f Nothing there

Netra t1 (UltraSPARC-IIi 440MHz), No Keyboard
OpenBoot 3.10.27 ME, 1024 MB memory installed, Serial #12731976.
Ethernet address 8:0:20:c2:46:48, Host ID: 80c24648.

ok

So finally boot the thing to the grub prompt :

ok
ok boot disk0
Boot device: /pci@1f,0/pci@1,1/scsi@2/disk@0,0 File and args:
GRUB Loading kernel...
Welcome to GRUB!

Here I select the "Advanced options for Debian GNU/Linux" and then I
want "Debian GNU/Linux, with Linux 5.10.0-1-sparc64 (recovery mode)" but
I want this to be verbose :

linux /vmlinux-5.10.0-1-sparc64 root=UUID=3e726aa3-b\ 1f1-4d65-be35-1436d9a04164 ro single verbose

Then thankfully CTRL-X does work fine :

[ 0.000150] PROMLIB: Sun IEEE Boot Prom 'OBP 3.10.27 2000/06/22 16:45'
[ 0.000244] PROMLIB: Root node compatible: sun4u
[ 0.000543] Linux version 5.10.0-1-sparc64
(debian-kernel@lists.debian.org) (gcc-10 (Debian 10.2.1-5) 10.2.1
20210108, GNU ld (GNU Binutils for Debian) 2.35.1) #1 Debian 5.10.5-1 (2021-01-09)
[ 0.336935] printk: bootconsole [earlyprom0] enabled
[ 0.396183] ARCH: SUN4U
[ 0.425389] Ethernet address: 08:00:20:c2:46:48
[ 0.479389] MM: PAGE_OFFSET is 0xfffff80000000000 (max_phys_bits == 40)
[ 0.558385] MM: VMALLOC [0x0000000100000000 --> 0x0000060000000000]
[ 0.633248] MM: VMEMMAP [0x0000060000000000 --> 0x00000c0000000000]
[ 0.711344] Kernel: Using 4 locked TLB entries for main kernel image.
[ 0.787508] Remapping the kernel...
[ 0.833324] done.
[ 0.376682] OF stdout device is: /pci@1f,0/pci@1,1/ebus@1/su@14,3803f8
[ 0.453816] PROM: Built device tree with 87107 bytes of memory.
[ 0.524799] Top of RAM: 0x3fec0000, Total RAM: 0x3feb4000
[ 0.588983] Memory hole size: 0MB
[ 0.639857] Allocated 16384 bytes for kernel page tables.
[ 0.703643] Zone ranges:
[ 0.733615] Normal [mem 0x0000000000000000-0x000000003febffff]
[ 0.807437] Movable zone start for each node
[ 0.858386] Early memory node ranges
[ 0.901022] node 0: [mem 0x0000000000000000-0x000000003fdfdfff]
[ 0.975890] node 0: [mem 0x000000003fe00000-0x000000003fe9ffff]
[ 1.050750] node 0: [mem 0x000000003feaa000-0x000000003febffff]
[ 1.125704] Zeroed struct page in unavailable ranges: 166 pages
[ 1.125719] Initmem setup node 0 [mem
0x0000000000000000-0x000000003febffff]
[ 1.344220] Booting Linux...
[ 1.377766] CPU CAPS: [flush,stbar,swap,muldiv,v9,mul32,div32,v8plus]
[ 1.454698] CPU CAPS: [vis]
[ 1.497630] Built 1 zonelists, mobility grouping on. Total pages: 129755
[ 1.577923] Kernel command line: BOOT_IMAGE=/vmlinux-5.10.0-1-sparc64 root=UUID=3e726aa3-b1f1-4d65-be35-1436d9a04164 ro single verbose
[ 1.736779] Dentry cache hash table entries: 131072 (order: 7,
1048576 bytes, linear)
[ 1.833540] Inode-cache hash table entries: 65536 (order: 6, 524288
bytes, linear)
[ 1.923125] Sorting __ex_table...
[ 1.964854] mem auto-init: stack:off, heap alloc:on, heap free:off
[ 2.108513] Memory: 994048K/1047248K available (8078K kernel code,
1425K rwdata, 2152K rodata, 496K init, 405K bss, 53200K reserved, 0K cma-reserved)
[ 2.267971] random: get_random_u64 called from __kmem_cache_create+0x20/0x420 with crng_init=0
[ 2.270696] SLUB: HWalign=32, Order=0-3, MinObjects=0, CPUs=1, Nodes=1
[ 2.451388] ftrace: allocating 25693 entries in 51 pages
[ 2.579233] ftrace: allocated 51 pages with 4 groups
[ 2.669744] NR_IRQS: 2048, nr_irqs: 2048, preallocated irqs: 1
[ 2.739236] clocksource: tick: mask: 0xffffffffffffffff max_cycles: 0x657b4868ff, max_idle_ns: 440795220381 ns
[ 2.858200] clocksource: mult[245cb72] shift[24]
[ 2.913305] clockevent: mult[70a500d6] shift[32]
[ 2.968675] Console: colour dummy device 80x25
[ 3.021504] printk: console [tty0] enabled
[ 3.070323] printk: bootconsole [earlyprom0] disabled
[ 0.000150] PROMLIB: Sun IEEE Boot Prom 'OBP 3.10.27 2000/06/22 16:45'
[ 0.000244] PROMLIB: Root node compatible: sun4u
[ 0.000543] Linux version 5.10.0-1-sparc64
(debian-kernel@lists.debian.org) (gcc-10 (Debian 10.2.1-5) 10.2.1
20210108, GNU ld (GNU Binutils for Debian) 2.35.1) #1 Debian 5.10.5-1 (2021-01-09)
[ 0.336935] printk: bootconsole [earlyprom0] enabled
[ 0.396183] ARCH: SUN4U
[ 0.425389] Ethernet address: 08:00:20:c2:46:48
[ 0.479389] MM: PAGE_OFFSET is 0xfffff80000000000 (max_phys_bits == 40)
[ 0.558385] MM: VMALLOC [0x0000000100000000 --> 0x0000060000000000]
[ 0.633248] MM: VMEMMAP [0x0000060000000000 --> 0x00000c0000000000]
[ 0.711344] Kernel: Using 4 locked TLB entries for main kernel image.
[ 0.787508] Remapping the kernel...
[ 0.833324] done.
[ 0.376682] OF stdout device is: /pci@1f,0/pci@1,1/ebus@1/su@14,3803f8
[ 0.453816] PROM: Built device tree with 87107 bytes of memory.
[ 0.524799] Top of RAM: 0x3fec0000, Total RAM: 0x3feb4000
[ 0.588983] Memory hole size: 0MB
[ 0.639857] Allocated 16384 bytes for kernel page tables.
[ 0.703643] Zone ranges:
[ 0.733615] Normal [mem 0x0000000000000000-0x000000003febffff]
[ 0.807437] Movable zone start for each node
[ 0.858386] Early memory node ranges
[ 0.901022] node 0: [mem 0x0000000000000000-0x000000003fdfdfff]
[ 0.975890] node 0: [mem 0x000000003fe00000-0x000000003fe9ffff]
[ 1.050750] node 0: [mem 0x000000003feaa000-0x000000003febffff]
[ 1.125704] Zeroed struct page in unavailable ranges: 166 pages
[ 1.125719] Initmem setup node 0 [mem
0x0000000000000000-0x000000003febffff]
[ 1.344220] Booting Linux...
[ 1.377766] CPU CAPS: [flush,stbar,swap,muldiv,v9,mul32,div32,v8plus]
[ 1.454698] CPU CAPS: [vis]
[ 1.497630] Built 1 zonelists, mobility grouping on. Total pages: 129755
[ 1.577923] Kernel command line: BOOT_IMAGE=/vmlinux-5.10.0-1-sparc64 root=UUID=3e726aa3-b1f1-4d65-be35-1436d9a04164 ro single verbose
[ 1.736779] Dentry cache hash table entries: 131072 (order: 7,
1048576 bytes, linear)
[ 1.833540] Inode-cache hash table entries: 65536 (order: 6, 524288
bytes, linear)
[ 1.923125] Sorting __ex_table...
[ 1.964854] mem auto-init: stack:off, heap alloc:on, heap free:off
[ 2.108513] Memory: 994048K/1047248K available (8078K kernel code,
1425K rwdata, 2152K rodata, 496K init, 405K bss, 53200K reserved, 0K cma-reserved)
[ 2.267971] random: get_random_u64 called from __kmem_cache_create+0x20/0x420 with crng_init=0
[ 2.270696] SLUB: HWalign=32, Order=0-3, MinObjects=0, CPUs=1, Nodes=1
[ 2.451388] ftrace: allocating 25693 entries in 51 pages
[ 2.579233] ftrace: allocated 51 pages with 4 groups
[ 2.669744] NR_IRQS: 2048, nr_irqs: 2048, preallocated irqs: 1
[ 2.739236] clocksource: tick: mask: 0xffffffffffffffff max_cycles: 0x657b4868ff, max_idle_ns: 440795220381 ns
[ 2.858200] clocksource: mult[245cb72] shift[24]
[ 2.913305] clockevent: mult[70a500d6] shift[32]
[ 2.968675] Console: colour dummy device 80x25
[ 3.021504] printk: console [tty0] enabled
[ 3.070323] printk: bootconsole [earlyprom0] disabled
[ 3.210745] Calibrating delay using timer specific routine.. 882.26
BogoMIPS (lpj=1764532)
[ 3.210796] pid_max: default: 32768 minimum: 301
[ 3.211313] LSM: Security Framework initializing
[ 3.211575] Yama: disabled by default; enable with sysctl kernel.yama.*
[ 3.212365] AppArmor: AppArmor initialized
[ 3.212407] TOMOYO Linux initialized
[ 3.212711] Mount-cache hash table entries: 2048 (order: 1, 16384
bytes, linear)
[ 3.212807] Mountpoint-cache hash table entries: 2048 (order: 1,
16384 bytes, linear)
[ 3.220341] devtmpfs: initialized
[ 3.227023] Performance events: No support for PMU type 'ultra12'
[ 3.228666] clocksource: jiffies: mask: 0xffffffff max_cycles:
0xffffffff, max_idle_ns: 7645041785100000 ns
[ 3.228742] futex hash table entries: 256 (order: -1, 6144 bytes, linear)
[ 3.231307] NET: Registered protocol family 16
[ 3.244320] IRQ pre handler NOT supported.
[ 3.246594] audit: initializing netlink subsys (disabled)
[ 3.248194] thermal_sys: Registered thermal governor 'fair_share'
[ 3.248208] thermal_sys: Registered thermal governor 'step_wise'
[ 3.254717] pci@1f,0: PCI IO [io 0x1fe02000000-0x1fe02ffffff] offset 1fe02000000
[ 3.254798] pci@1f,0: PCI MEM [mem 0x1ff00000000-0x1ffffffffff]
offset 1ff00000000
[ 3.254868] audit: type=2000 audit(0.128:1): state=initialized audit_enabled=0 res=1
[ 3.254940] pci@1f,0: SABRE PCI Bus Module ver[0:0]
[ 3.255006] PCI: Scanning PBM /pci@1f,0
[ 3.256591] sabre f00694e0: PCI host bridge to bus 0000:00
[ 3.256653] pci_bus 0000:00: root bus resource [io 0x1fe02000000-0x1fe02ffffff] (bus address [0x0000-0xffffff])
[ 3.256709] pci_bus 0000:00: root bus resource [mem 0x1ff00000000-0x1ffffffffff] (bus address [0x00000000-0xffffffff])
[ 3.256765] pci_bus 0000:00: root bus resource [bus 00-03]
[ 3.256875] pci 0000:00:01.1: [108e:5000] type 01 class 0x060400
[ 3.257431] pci_bus 0000:01: extended config space not accessible
[ 3.258733] pci 0000:01:01.0: [108e:1000] type 00 class 0x068000
[ 3.259220] pci 0000:01:01.1: [108e:1001] type 00 class 0x020000
[ 3.259274] pci 0000:01:01.1: reg 0x10: [mem 0x1ffe0000000-0x1ffe000701f]
[ 3.259687] pci 0000:01:02.0: [1000:000f] type 00 class 0x010000
[ 3.259738] pci 0000:01:02.0: reg 0x10: [io 0x1fe02c00000-0x1fe02c000ff]
[ 3.259778] pci 0000:01:02.0: reg 0x14: [mem 0x1ffe0008000-0x1ffe00080ff]
[ 3.259817] pci 0000:01:02.0: reg 0x18: [mem 0x1ffe000a000-0x1ffe000afff]
[ 3.260277] pci 0000:01:03.1: [108e:1001] type 00 class 0x020000
[ 3.260328] pci 0000:01:03.1: reg 0x10: [mem 0x1ffe0010000-0x1ffe001701f]
[ 3.260732] pci 0000:00:01.0: [108e:5000] type 01 class 0x060400
[ 3.261269] pci_bus 0000:02: extended config space not accessible
[ 3.261757] pci 0000:02:01.0: [1011:0022] type 01 class 0x060400
[ 3.262381] pci_bus 0000:03: extended config space not accessible
[ 3.262745] pci 0000:02:01.0: PCI bridge to [bus 03]
[ 3.262795] pci 0000:02:01.0: bridge window [io 0x1fe02001000-0x1fe02001fff]
[ 3.262910] pci 0000:03:0e.0: [1095:0646] type 00 class 0x01018f
[ 3.262955] pci 0000:03:0e.0: reg 0x10: [io 0x1fe02001000-0x1fe02001007]
[ 3.262994] pci 0000:03:0e.0: reg 0x14: [io 0x1fe02001008-0x1fe0200100b]
[ 3.263053] pci 0000:03:0e.0: reg 0x18: [io 0x1fe02001010-0x1fe02001017]
[ 3.263095] pci 0000:03:0e.0: reg 0x1c: [io 0x1fe02001018-0x1fe0200101b]
[ 3.263133] pci 0000:03:0e.0: reg 0x20: [io 0x1fe02001020-0x1fe0200102f]
[ 3.274131] HugeTLB registered 64.0 KiB page size, pre-allocated 0 pages
[ 3.274188] HugeTLB registered 8.00 MiB page size, pre-allocated 0 pages
[ 3.274224] HugeTLB registered 256 MiB page size, pre-allocated 0 pages
[ 3.274261] HugeTLB registered 2.00 GiB page size, pre-allocated 0 pages
[ 8.656595] iommu: Default domain type: Translated
[ 8.657029] vgaarb: loaded
[ 8.659927] NetLabel: Initializing
[ 8.659967] NetLabel: domain hash size = 128
[ 8.659993] NetLabel: protocols = UNLABELED CIPSOv4 CALIPSO
[ 8.660225] NetLabel: unlabeled traffic allowed by default
[ 8.660926] /pci@1f,0/pci@1,1/ebus@1/eeprom@14,0: Mostek regs at 0x1fff1000000
[ 8.661876] AUXIO: Found device at
/pci@1f,0/pci@1,1/ebus@1/auxio@14,726000
[ 8.662458] clocksource: Switched to clocksource tick
[ 8.832572] VFS: Disk quotas dquot_6.6.0
[ 8.832961] VFS: Dquot-cache hash table entries: 1024 (order 0, 8192
bytes)
[ 8.835397] AppArmor: AppArmor Filesystem Enabled
[ 8.855565] NET: Registered protocol family 2
[ 8.857724] tcp_listen_portaddr_hash hash table entries: 512 (order:
0, 8192 bytes, linear)
[ 8.858120] TCP established hash table entries: 8192 (order: 3, 65536
bytes, linear)
[ 8.858854] TCP bind hash table entries: 8192 (order: 3, 65536 bytes, linear)
[ 8.859289] TCP: Hash tables configured (established 8192 bind 8192)
[ 8.859871] UDP hash table entries: 512 (order: 1, 16384 bytes, linear)
[ 8.860089] UDP-Lite hash table entries: 512 (order: 1, 16384 bytes,
linear)
[ 8.860909] NET: Registered protocol family 1
[ 8.860988] NET: Registered protocol family 44
[ 8.861206] PCI: CLS 64 bytes, default 64
[ 8.861898] Trying to unpack rootfs image as initramfs...
[ 12.212477] Freeing initrd memory: 22096K
[ 12.213039] power: Control reg at 1fff1724000
[ 12.216912] Initialise system trusted keyrings
[ 12.217110] Key type blacklist registered
[ 12.217806] workingset: timestamp_bits=46 max_order=17 bucket_order=0
[ 12.237080] zbud: loaded
[ 12.239572] integrity: Platform Keyring initialized
[ 12.239627] Key type asymmetric registered
[ 12.239663] Asymmetric key parser 'x509' registered
[ 12.239818] Block layer SCSI generic (bsg) driver version 0.4 loaded
(major 252)
[ 12.240307] io scheduler mq-deadline registered
[ 12.244281] f006b9fc: ttyS0 at MMIO 0x1fff13803f8 (irq = 5, base_baud
= 115387) is a 16550A
[ 12.244343] Console: ttyS0 (SU)
[ 22.827690] printk: console [ttyS0] enabled
[ 22.883564] f006d240: ttyS1 at MMIO 0x1fff13602f8 (irq = 6, base_baud
= 115387) is a 16550A
[ 22.996286] mousedev: PS/2 mouse device common for all mice
[ 23.074818] rtc-m48t59 rtc-m48t59.0: IRQ index 0 not found
[ 23.147559] rtc-m48t59 rtc-m48t59.0: registered as rtc0
[ 23.216279] rtc-m48t59 rtc-m48t59.0: hctosys: unable to read the
hardware clock
[ 23.312888] ledtrig-cpu: registered to indicate activity on CPUs
[ 23.393792] NET: Registered protocol family 10
[ 23.582874] Segment Routing with IPv6
[ 23.631448] mip6: Mobile IPv6
[ 23.670441] NET: Registered protocol family 17
[ 23.729161] mpls_gso: MPLS GSO support
[ 23.778922] registered taskstats version 1
[ 23.832799] Loading compiled-in X.509 certificates
[ 25.123021] Loaded X.509 cert 'Debian Secure Boot CA: 6ccece7e4c6c0d1f6149f3dd27dfcc5cbb419ea1'
[ 25.237711] Loaded X.509 cert 'Debian Secure Boot Signer 2020:
00b55eb3b9'
[ 25.328557] zswap: loaded using pool lzo/zbud
[ 25.386968] Key type ._fscrypt registered
[ 25.439661] Key type .fscrypt registered
[ 25.491185] Key type fscrypt-provisioning registered
[ 25.556954] AppArmor: AppArmor sha1 policy hashing enabled
[ 25.638272] This architecture does not have kernel memory protection.
[ 25.722972] Run /init as init process
Loading, please wait...
Starting version 247.2-4
[ 26.860422] sunhme: module verification failed: signature and/or
required key missing - tainting kernel
[ 27.060189] SCSI subsystem initialized
[ 27.336195] sym53c8xx 0000:01:02.0: enabling device (0146 -> 0147)
[ 27.464883] scsi host0: pata_cmd64x
[ 27.514792] hme 0000:01:01.1: enabling device (0000 -> 0002)
[ 27.593947] scsi host1: pata_cmd64x
[ 27.641107] sym0: <875> rev 0x3 at pci 0000:01:02.0 irq 13
[ 27.714970] ata1: PATA max MWDMA2 cmd 0x1fe02001000 ctl 0x1fe02001008
bmdma 0x1fe02001020 irq 15
[ 27.830604] ata2: PATA max MWDMA2 cmd 0x1fe02001010 ctl 0x1fe02001018
bmdma 0x1fe02001028 irq 15
[ 27.946826] sunhme.c:v3.10 August 26, 2008 David S. Miller (davem@davemloft.net)
[ 28.133905] sym0: No NVRAM, ID 7, Fast-20, SE, parity checking
[ 28.226805] eth0: HAPPY MEAL (PCI/CheerIO) 10/100BaseT Ethernet
[ 28.226820] 08:00:20:c2:46:48
[ 28.351955] sym0: SCSI BUS has been reset.
[ 28.405818] scsi host2: sym-2.2.3
[ 28.489885] hme 0000:01:03.1: enabling device (0000 -> 0002)
[ 28.604654] eth1: HAPPY MEAL (PCI/CheerIO) 10/100BaseT Ethernet
[ 28.604668] 08:00:20:c2:46:48
[ 28.754364] hme 0000:01:03.1 enp1s3f1: renamed from eth1
[ 28.826843] hme 0000:01:01.1 enp1s1f1: renamed from eth0
[ 31.470920] scsi 2:0:0:0: Direct-Access SEAGATE ST318404LSUN18G
5221 PQ: 0 ANSI: 3
[ 31.577549] scsi target2:0:0: tagged command queuing enabled, command
queue depth 16.
[ 31.680720] scsi target2:0:0: Beginning Domain Validation
[ 31.759929] scsi target2:0:0: FAST-20 WIDE SCSI 40.0 MB/s ST (50 ns,
offset 16)
[ 31.859847] scsi target2:0:0: Domain Validation skipping write tests
[ 31.943434] scsi target2:0:0: Ending Domain Validation
[ 32.014018] scsi 2:0:1:0: Direct-Access SEAGATE ST318404LSUN18G
8590 PQ: 0 ANSI: 3
[ 32.120590] scsi target2:0:1: tagged command queuing enabled, command
queue depth 16.
[ 32.223760] scsi target2:0:1: Beginning Domain Validation
[ 32.302948] scsi target2:0:1: FAST-20 WIDE SCSI 40.0 MB/s ST (50 ns,
offset 16)
[ 32.402922] scsi target2:0:1: Domain Validation skipping write tests
[ 32.486509] scsi target2:0:1: Ending Domain Validation
[ 36.476422] sd 2:0:0:0: Power-on or device reset occurred
[ 36.548364] sd 2:0:1:0: Power-on or device reset occurred
[ 36.620250] sd 2:0:0:0: [sda] 35378533 512-byte logical blocks: (18.1 GB/16.9 GiB)
[ 36.720040] sd 2:0:1:0: [sdb] 35378533 512-byte logical blocks: (18.1 GB/16.9 GiB)
[ 36.825662] sd 2:0:1:0: [sdb] Write Protect is off
[ 36.888823] sd 2:0:0:0: [sda] Write Protect is off
[ 36.953967] sd 2:0:1:0: [sdb] Write cache: disabled, read cache:
enabled, supports DPO and FUA
[ 37.067387] sd 2:0:0:0: [sda] Write cache: disabled, read cache:
enabled, supports DPO and FUA
[ 37.198270] sdb: sdb1 sdb2 sdb3 sdb4 sdb5 sdb6 sdb7 sdb8
[ 37.269752] sda: sda1 sda2 sda3 sda4
[ 37.331118] sd 2:0:0:0: [sda] Attached SCSI disk
[ 37.402899] sd 2:0:1:0: [sdb] Attached SCSI disk
[ 37.753276] random: fast init done
Begin: Loading essential drivers ... done.
Begin: Running /scripts/init-premount ... done.
Begin: Mounting root file system ... Begin: Running /scripts/local-top
... done.
Begin: Running /scripts/local-premount ... done.
Begin: Will now check root file system ... fsck from util-linux 2.36.1 [/sbin/fsck.ext4 (1) -- /dev/sda2] fsck.ext4 -a -C0 /dev/sda2
/dev/sda2: recovering journal
[ 88.816750] random: crng init done
/dev/sda2: Clearing orphaned inode 654139 (uid=0, gid=0, mode=0100644, size=36624)
/dev/sda2: Clearing orphaned inode 658478 (uid=0, gid=0, mode=0100644, size=153440)

[continued in next message]

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Frank Scheiner@21:1/5 to Dennis Clarke on Sat Mar 13 22:20:02 2021

Hi Dennis,

On 13.03.21 20:21, Dennis Clarke wrote:

On 3/13/21 5:29 PM, Mike Tremaine wrote:

On Mar 12, 2021, at 5:56 AM,
Dennis Clarke <dclarke@blastwave.org> wrote:

[...]
I did sent a BRK to the serial port and that drops us into the firmware
"ok" prompt. There is a failed fan but in fact the fan is entirely not there. At all. I removed it because it had failed five or six years ago
and getting another one is just annoying. Also it is not really needed.

Is the heatsink on the board cooled by a chassis then?

We can see that there is 1G of ECC memory and the memory passes all the
basic tests.

Now I setup a few of the firmware variables and reset the unit :

ok printenv
Variable Name Value Default Value

[...]
local-mac-address? false false
[...]

ceres# ip link show
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: enp1s1f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UNKNOWN mode DEFAULT group default qlen 1000
link/ether 08:00:20:c2:46:48 brd ff:ff:ff:ff:ff:ff
3: enp1s3f1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000
link/ether 08:00:20:c2:46:48 brd ff:ff:ff:ff:ff:ff
ceres#

However there must be a bug somewhere because the physical MAC address
is the same on both interfaces.

This is due to `local-mac-address?` set to `false` in OBP. See e.g. [1]
for details.

[1]: https://docs.oracle.com/cd/E36784_01/html/E37475/eyprp.html

Cheers,
Frank

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Mike Tremaine@21:1/5 to All on Sun Mar 14 18:20:02 2021

Let’s assume it’s not hardware, Dennis has posted the tests and states the machine ran Sol10 fine. My only ideas are

1) Try using apt to update some individual packages to see if that even works. Try dash and bash and whatever but avoid Systemd and any related libraries.

2a) If those succeed trying update systemd and see if causes the crash.

or

2b) Trying re-exec systemd I think “kill 1” does that these days.

If you can isolate that it is systemd related the question why that, is it something in the Dbus or some other subsystem.???

In a month or so I’ll be finally going to storage and I’d happy to grab my Netra t105 and play along at that point, it would interesting to know if this issue is specific to the Netra series.

-Mike

On Mar 13, 2021, at 12:58 PM, Frank Scheiner <frank.scheiner@web.de> wrote:

Hi Dennis,

On 13.03.21 20:21, Dennis Clarke wrote:

On 3/13/21 5:29 PM, Mike Tremaine wrote:

On Mar 12, 2021, at 5:56 AM,
Dennis Clarke <dclarke@blastwave.org> wrote:

[...]
I did sent a BRK to the serial port and that drops us into the firmware
"ok" prompt. There is a failed fan but in fact the fan is entirely not
there. At all. I removed it because it had failed five or six years ago
and getting another one is just annoying. Also it is not really needed.

Is the heatsink on the board cooled by a chassis then?

We can see that there is 1G of ECC memory and the memory passes all the
basic tests.

Now I setup a few of the firmware variables and reset the unit :

ok printenv
Variable Name Value Default Value

[...]
local-mac-address? false false
[...]

ceres# ip link show
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode
DEFAULT group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: enp1s1f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast
state UNKNOWN mode DEFAULT group default qlen 1000
link/ether 08:00:20:c2:46:48 brd ff:ff:ff:ff:ff:ff
3: enp1s3f1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode
DEFAULT group default qlen 1000
link/ether 08:00:20:c2:46:48 brd ff:ff:ff:ff:ff:ff
ceres#

However there must be a bug somewhere because the physical MAC address
is the same on both interfaces.

This is due to `local-mac-address?` set to `false` in OBP. See e.g. [1]
for details.

[1]: https://docs.oracle.com/cd/E36784_01/html/E37475/eyprp.html

Cheers,
Frank

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From John Paul Adrian Glaubitz@21:1/5 to Mike Tremaine on Sun Mar 14 18:40:01 2021

On 3/14/21 5:55 PM, Mike Tremaine wrote:

Let’s assume it’s not hardware, Dennis has posted the tests and states the machine ran Sol10 fine.

The fact that Solaris runs fine can be an indicator the hardware is okay, but it's not a proper verification that it's actually the case.

For example, if one of the memory modules is bad, it could happen that the error
shows on Linux but not on Solaris because both allocate different memory regions
right after the machine has started.

So, if, for example, you want to verify that the memory is okay, you should run a memtest program.

Adrian

--
.''`. John Paul Adrian Glaubitz
: :' : Debian Developer - glaubitz@debian.org
`. `' Freie Universitaet Berlin - glaubitz@physik.fu-berlin.de
`- GPG: 62FF 8A75 84E0 2956 9546 0006 7426 3B37 F5B5 F913

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From John Paul Adrian Glaubitz@21:1/5 to Frank Scheiner on Sun Mar 14 19:10:01 2021

On 3/14/21 6:48 PM, Frank Scheiner wrote:

So, if, for example, you want to verify that the memory is okay, you should run
a memtest program.

...the built-in (memory) diagnostics of Sun machines are pretty
thorough. This is not a PC. :-)

I doubt that the hardware runs a thorough memory test by default that
can be compared to a full memtest86 test run.

Either way, if the kernel breaks for someone, they will have to bisect the issue. I don't have any means in bisecting a problem if I cannot reproduce
it in the first place.

Adrian

--
.''`. John Paul Adrian Glaubitz
: :' : Debian Developer - glaubitz@debian.org
`. `' Freie Universitaet Berlin - glaubitz@physik.fu-berlin.de
`- GPG: 62FF 8A75 84E0 2956 9546 0006 7426 3B37 F5B5 F913

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Dennis Clarke@21:1/5 to John Paul Adrian Glaubitz on Mon Mar 15 00:10:01 2021

On 3/14/21 5:52 PM, John Paul Adrian Glaubitz wrote:

On 3/14/21 6:48 PM, Frank Scheiner wrote:

So, if, for example, you want to verify that the memory is okay, you should run
a memtest program.

...the built-in (memory) diagnostics of Sun machines are pretty
thorough. This is not a PC. :-)

I doubt that the hardware runs a thorough memory test by default that
can be compared to a full memtest86 test run.

The probability that there is a memory hardware fault after the ECC
memory tests done during POST would be very very low. So close to zero
that I can not even begin to guess how a memory fault would slip past
those ECC diagnostics. Those run for quite a while and I have never
seen evidence that there was a problem.

See : https://lists.debian.org/debian-sparc/2021/03/msg00026.html

Regardless we are just going in circles.

I don't know if this is a kernel problem or what. I only know that
something goes terribly wrong and it may be a systemd related problem.

I think Frank Scheiner made some suggestions and I will go and give a
try at isolating the issue.

Either way, if the kernel breaks for someone, they will have to bisect the issue. I don't have any means in bisecting a problem if I cannot reproduce
it in the first place.

I agree completely.

Dennis

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Anatoly Pugachev@21:1/5 to dclarke@blastwave.org on Mon Mar 15 10:50:01 2021

On Fri, Mar 12, 2021 at 5:27 PM Dennis Clarke <dclarke@blastwave.org> wrote:

I have seen this for a few months now. The old old netra machine will
run just fine endlessly but if I attempt to perform a package update
then I am always assured to see :

ceres# apt-get update
Get:1 http://deb.debian.org/debian-ports sid InRelease [55.3 kB]
Get:2 http://deb.debian.org/debian-ports sid/main sparc64 Packages [21.6 MB] Get:3 http://deb.debian.org/debian-ports sid/main all Packages [8,682
kB]
Fetched 30.3 MB in 1min 24s (361 kB/s)

Reading package lists... Done
ceres#

Then try "upgrade" and the machine drops off the network :

Setting up systemd (247.3-1) ...
Timeout, server 172.16.35.61 not responding.

Dennis,

did you tried to test machine with stress-ng ? There's a lot of tests
in it, it could trigger your issue and probably would be easier to
hunt down the issue.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Michal Wronka
  Wed Apr 24 14:13:57 2024
  from Wroclaw, Poland via SSH
- Michal Wronka
  Wed Apr 24 14:02:51 2024
  from Wroclaw, Poland via SSH
- Guest
  Wed Apr 24 01:40:10 2024
  from A via Telnet
- Bob Worm
  Thu Apr 25 11:52:12 2024
  from Wales, Uk via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	296
Nodes:	16 (3 / 13)
Uptime:	49:23:29
Calls:	6,649
Calls today:	1
Files:	12,200
Messages:	5,330,097

watchdog: BUG: soft lockup - CPU#0 stuck for 23s! [systemd:1]

Who's Online

Recent Visitors

System Info