• Bug#1064475: lists.debian.org: missing recent posts in search indices.

    From James Addison@21:1/5 to All on Thu Feb 22 22:10:01 2024
    Package: lists.debian.org
    Severity: important

    Dear Maintainer,

    Some recent discussion from the Debian mailing lists appears to be missing
    from the mailing list search engine[1] indices.

    For example:

    * Searching for 'Debian'[2] in the debian-devel mailing list, results sorted
    by date, currently retrieves a most-recent result dated February of Y2023.

    * A wider search for 'Debian'[3] without specifying an individual mailing
    list currently retrieves results up to December of Y2023.

    Could you inspect the indexing hosts/processes to check whether posts are being indexed as expected?

    Thank you,
    James

    [1] - https://lists.debian.org/search.html

    [2] - https://lists.debian.org/cgi-bin/search?P=debian&B=Gdebian-devel&SORT=0

    [3] - https://lists.debian.org/cgi-bin/search?P=debian&SORT=0

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From James Addison@21:1/5 to All on Sat Mar 9 19:40:01 2024
    Package: lists.debian.org
    Followup-For: Bug #1064475
    X-Debbugs-Cc: cord@debian.org

    Hi Cord,

    Running a search for 'python removal' on the 'testing-changes' mailing list, ordered by most-recent-first, currently lacks any results from this year.

    https://lists.debian.org/cgi-bin/search?P=python+removal&DEFAULTOP=and&B=Gdebian-testing-changes&SORT=0&HITSPERPAGE=100&xP=python%09removal&xFILTERS=Gdebian-testing-changes%7E.%7E%7E0

    The most recent item that appears is:

    "Testing removal summary 2023-02-04 (Saturday) Debian testing watch"

    And an example item that I would expect to see included is:

    https://lists.debian.org/debian-testing-changes/2024/02/msg00054.html

    Regards,
    James

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Olly Betts@21:1/5 to James Addison on Sun Mar 10 22:40:02 2024
    Thanks for reporting this.

    On Sat, Mar 09, 2024 at 06:29:22PM +0000, James Addison wrote:
    Running a search for 'python removal' on the 'testing-changes' mailing list, ordered by most-recent-first, currently lacks any results from this year.

    https://lists.debian.org/cgi-bin/search?P=python+removal&DEFAULTOP=and&B=Gdebian-testing-changes&SORT=0&HITSPERPAGE=100&xP=python%09removal&xFILTERS=Gdebian-testing-changes%7E.%7E%7E0

    There are currently 7 shards in the lists database, but only the first 6
    were listed to be searched. It looks like indexing is working fine,
    except it started a new shard and failed to update the list to search.

    I've manually fixed this and now the search above gives me a top result
    of:

    Testing removal summary 2024-03-10 (Sunday)

    I really need to resolve why this didn't happen automatically or else
    this will go wrong again in the future.

    Cheers,
    Olly

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From James Addison@21:1/5 to Olly Betts on Mon Mar 11 11:30:01 2024
    Hi Olly,

    On Sun, 10 Mar 2024 at 20:42, Olly Betts <olly@survex.com> wrote:

    [ ... snip ... ]
    There are currently 7 shards in the lists database, but only the first 6
    were listed to be searched. It looks like indexing is working fine,
    except it started a new shard and failed to update the list to search.

    I've manually fixed this and now the search above gives me a top result
    of:

    Testing removal summary 2024-03-10 (Sunday)

    Thank you very much! The search results now look much better to me too.

    Regards,
    James

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)