On 19/03/24 at 15:43, Jesper Dybdal wrote:Good idea! That should of course be done first. It's running now.
My plan is to boot a rescue disk and mount that partition read-only.
Then:
* If the file looks ok after reboot, then I'll strongly suspect the
RAM - and run memtest.
* Otherwise, I'll have to run fsck and see what happens.
kernel version:
root@nuser:~# uname -a
Linux nuser 5.10.0-28-amd64 #1 SMP Debian 5.10.209-2 (2024-01-31)
x86_64 GNU/Linux
The partition in question is a RAID 1 controlled by md.
Another check you can perform it is on the RAID array, by default it
runs on the first Sunday of each month at 00:57. You should have this
file /etc/cron.d/mdadm that takes care to run this check monthly.
Before you reboot, does it look OK /proc/mdstat ?Yes, it seems ok.
On 19/03/24 at 15:43, Jesper Dybdal wrote:
My plan is to boot a rescue disk and mount that partition read-only.
Then:
* If the file looks ok after reboot, then I'll strongly suspect the
RAM - and run memtest.
* Otherwise, I'll have to run fsck and see what happens.
kernel version:
root@nuser:~# uname -a
Linux nuser 5.10.0-28-amd64 #1 SMP Debian 5.10.209-2 (2024-01-31)
x86_64 GNU/Linux
The partition in question is a RAID 1 controlled by md.
Another check you can perform it is on the RAID array, by default it
runs on the first Sunday of each month at 00:57. You should have this
file /etc/cron.d/mdadm that takes care to run this check monthly.
Before you reboot, does it look OK /proc/mdstat ?
On Wed, Mar 20, 2024, 11:28 AM Jesper Dybdal
<jd-debian-user@dybdal.dk> wrote:
I have now done the following:
* Checked the RAID array - no problems found.
* Run fsck. It found three cases of the block count being
incorrect. I
don't know which the other two affected files are.
* Run one pass of memtest86+. Nothing found.
So it seems not to be a problem with the disks.
A bug in ext4? Well, ext4 has always done its job for me wihtout
problems.
A RAM error that memtest86+ did not find? Possible. Once upon a
time,
when you bought an ordinary pc, its RAM had ECC as a matter of
course;
unfortunately, that is not the case nowadays.
I think I'll let memtest86+ run overnight one of the coming nights.
Unless it is simply a RAM error, then it is a bit scary...
I have seen that a couple times, unlikely but possible. Maybe review
your RAM configuration too, ensure that the sticks are on the same
supported refresh rate and distributed across the slots in an approved
way.
Regards,
Jesper
On 2024-03-19 21:47, Franco Martelli wrote:
> On 19/03/24 at 15:43, Jesper Dybdal wrote:
>
>>
>> My plan is to boot a rescue disk and mount that partition
read-only.
>> Then:
>> * If the file looks ok after reboot, then I'll strongly suspect
the
>> RAM - and run memtest.
>> * Otherwise, I'll have to run fsck and see what happens.
>>
>> kernel version:
>> root@nuser:~# uname -a
>> Linux nuser 5.10.0-28-amd64 #1 SMP Debian 5.10.209-2 (2024-01-31)
>> x86_64 GNU/Linux
>>
>> The partition in question is a RAID 1 controlled by md.
>
> Another check you can perform it is on the RAID array, by
default it
> runs on the first Sunday of each month at 00:57. You should have
this
> file /etc/cron.d/mdadm that takes care to run this check monthly.
>
> Before you reboot, does it look OK /proc/mdstat ?
>
--
Jesper Dybdal
https://www.dybdal.dk
On Wed, Mar 20, 2024, 11:28 AM Jesper Dybdal
<jd-debian-user@dybdal.dk> wrote:
I think I'll let memtest86+ run overnight one of the coming nights.
Unless it is simply a RAM error, then it is a bit scary...
I have seen that a couple times, unlikely but possible. Maybe review
your RAM configuration too, ensure that the sticks are on the same
supported refresh rate and distributed across the slots in an approved
way.
[Sorry - I accidentally sent this too quickly in an incomplete state.
Second try here:]
On Wed, Mar 20, 2024, 11:28 AM Jesper Dybdal
<jd-debian-user@dybdal.dk> wrote:
I think I'll let memtest86+ run overnight one of the coming nights.
Unless it is simply a RAM error, then it is a bit scary...
I've now let memtest86+ run for 9 hours, during which it did 14 passes
of all its tests. It found nothing wrong.
On 2024-03-20 22:58, Nicholas Geovanis wrote:
I have seen that a couple times, unlikely but possible. Maybe review
your RAM configuration too, ensure that the sticks are on the same supported refresh rate and distributed across the slots in an approved
way.
There is only one RAM stick (of 16 GB), so there should be no problems
of that kind.
I'm afraid I won't find an explanation of that file system corruption :-(
Thanks to Franco and Nicholas for your responses,
Jesper
Am Donnerstag, 28. März 2024, 14:49:37 CET schrieb Jesper Dybdal:
Hello,
memtest86+ is for testing RAM, but do you not want to test ext4 filesystem?
If so, I suggest to boot a live system like Knoppix or similar, then run your test by using
e2fsck -y /dev/sda1
or wherever your filesystem resides.
Please pay attention: If you have encrypted filesystems, then first open the encryption, do NOT mount the filesystem and then check it, for example:
cryptsetup luksOpen /dev/sda1 data1
then enter the password and now you can run
e2fsck -y /dev/mapper/data1
Note: the word "data1" is only an example, you can name it, whatever you want like "space", "soap", "bullet", "henry" or whatever.
Hope this helps.
Best
Hans
[Sorry - I accidentally sent this too quickly in an incomplete state.
Second try here:]
On Wed, Mar 20, 2024, 11:28 AM Jesper DybdalI've now let memtest86+ run for 9 hours, during which it did 14 passes
<jd-debian-user@dybdal.dk> wrote:
I think I'll let memtest86+ run overnight one of the coming nights. >>>
Unless it is simply a RAM error, then it is a bit scary...
of all its tests. It found nothing wrong.
On 2024-03-20 22:58, Nicholas Geovanis wrote:
I have seen that a couple times, unlikely but possible. Maybe reviewThere is only one RAM stick (of 16 GB), so there should be no problems
your RAM configuration too, ensure that the sticks are on the same
supported refresh rate and distributed across the slots in an approved
way.
of that kind.
I'm afraid I won't find an explanation of that file system corruption :-(
Thanks to Franco and Nicholas for your responses,
Jesper
Sorry - I should have left more of the previous mails quoted. I have previously tested the RAID1 consistency (ok), fixed the file system
(found 3 files with incorrect block count), and now also tested the
RAM.And since it seems unlikely that it is a bug in ext4 (in Debian Bullseye), I don't quite understand how such an inconsistency can occur. Thanks for your response, Jesper
If so, I suggest to boot a live system like Knoppix or similar, then run your test by using
e2fsck -y /dev/sda1
or wherever your filesystem resides.
Please pay attention: If you have encrypted filesystems, then first open the encryption, do NOT mount the filesystem and then check it, for
example:
cryptsetup luksOpen /dev/sda1 data1
then enter the password and now you can run
e2fsck -y /dev/mapper/data1
Note: the word "data1" is only an example, you can name it, whatever you want like "space", "soap", "bullet", "henry" or whatever.
Hope this helps.
Best
Hans
[Sorry - I accidentally sent this too quickly in an incomplete state.
Second try here:]
On Wed, Mar 20, 2024, 11:28 AM Jesper Dybdal
<jd-debian-user@dybdal.dk> wrote:
I think I'll let memtest86+ run overnight one of the coming nights. >>>
Unless it is simply a RAM error, then it is a bit scary...
I've now let memtest86+ run for 9 hours, during which it did 14 passes
of all its tests. It found nothing wrong.
On 2024-03-20 22:58, Nicholas Geovanis wrote:
I have seen that a couple times, unlikely but possible. Maybe review
your RAM configuration too, ensure that the sticks are on the same
supported refresh rate and distributed across the slots in an approved >>> way.
There is only one RAM stick (of 16 GB), so there should be no problems
of that kind.
I'm afraid I won't find an explanation of that file system corruption :-( >>
Thanks to Franco and Nicholas for your responses,
Jesper
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 297 |
Nodes: | 16 (2 / 14) |
Uptime: | 32:22:39 |
Calls: | 6,669 |
Calls today: | 1 |
Files: | 12,216 |
Messages: | 5,338,220 |