Forum: >>> Magnum BBS <<<

[UDD] Is there any information about failed autopkgtest in UDD?

From Andreas Tille@21:1/5 to Antonio Terceiro on Thu Apr 9 17:00:02 2020

Hi Antonio,

greetings to Curitiba. ;-)
Hope you are fine!

On Thu, Apr 09, 2020 at 09:37:41AM -0300, Antonio Terceiro wrote:

On Thu, Apr 09, 2020 at 10:44:18AM +0200, Paul Gevers wrote:

Antonio, what do you think? If we expose the blacklist, I can also have britney consume it.

I think it could be exposed, yes. I'm currrently working on optimizing
the generation of all of that data, so I added to my TODO list and will
sneak that in.

Do you expect new important fields in the json file or do you intend to
change its structure in principle? It would be stupid for me to write a gatherer for UDD and once finished I learn that basic things might have changed.

Kind regards

Andreas.

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Antonio Terceiro@21:1/5 to Andreas Tille on Thu Apr 9 17:20:01 2020

On Thu, Apr 09, 2020 at 04:55:30PM +0200, Andreas Tille wrote:

Hi Antonio,

greetings to Curitiba. ;-)
Hope you are fine!

On Thu, Apr 09, 2020 at 09:37:41AM -0300, Antonio Terceiro wrote:

On Thu, Apr 09, 2020 at 10:44:18AM +0200, Paul Gevers wrote:

Antonio, what do you think? If we expose the blacklist, I can also have britney consume it.

I think it could be exposed, yes. I'm currrently working on optimizing
the generation of all of that data, so I added to my TODO list and will sneak that in.

Do you expect new important fields in the json file or do you intend to change its structure in principle?

No, the format will not change.

FWIW I concur with what Paul said regarding which fields from
packages.json you should not consider.

-----BEGIN PGP SIGNATURE-----

iQIzBAABCAAdFiEEst7mYDbECCn80PEM/A2xu81GC94FAl6PPKAACgkQ/A2xu81G C96zQQ/+OCChaYu3WiRskwaWLWgVFiyk3NgNZp6wYNp/6yigXGddaD/vrQbbyopp uwZKSFL+3ZPxnJP7D8kIFBwwhywZKxWgYrkDN46LFBUabbM0LVBh0ENrGjy7ZIE4 3ixFxStfoUeujdvxLLG42MAPGGAUFgxSBdoJYFBOppzuxTUQhEFcuGyCFOcMmDGI GWXIigFVcPrkcVoELm5tImMZJD7kfkcR34f7F8L2aJiOxs8jd3dM9KaBNpxGnhv+ TUr56ZLksy6Hdkn0Kr7x1oiveYLjMFCf/WXumFfluTjM0LDHZ4TjcoDr/fIITD2i FSMgdxe/gL9aVsOhpqBtIoZ1DFFdGtqo4mxN6XBhYMawUtZ4Wo/vuAHjvfDerq1R 3Oecdo268p95gEKddVoVOc7GFNnpnNiP2l7SrqGtc9Q7OO1MEv578rzLL3dhgN0E z0l9g561ji3vUN9j8/vvOI1B9T+JGa9/bPvHoEuH5QxXLrzs6cnqGiVEil4XChNU NxhH3/lTmdGvW6JsBintRDjXlKC6KZFkGNFtlbsXI8PgUtot7LKCY82ZjuS9i9il GKo+uCD9l4D5m/xCll0aYP+NyG1TbHgkEISx9lbhPYui1tIxxv+DYuwtEYxpeyOY QsZEF7hEzOZGvZTL3Sc1T2X/D2gBT77PiVEN4420i1JE+P9Q03o=
=7LT3
-----END PGP SIGNATURE-----

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Paul Gevers on Thu Apr 9 23:00:01 2020

Hi Paul,

On Thu, Apr 09, 2020 at 10:44:18AM +0200, Paul Gevers wrote:

May be you consider some fields as really restricted to some
special applications and nobody would ever consider querying
UDD for it?

Yes, I wouldn't add blame (I don't think we add those anymore, it's
legacy, I only found two occurrences on amd64), created_at and
updated_at. I also don't think that worker and message are very useful,
but one never knows. duration_seconds and duration_human feel double and
the former is leaner for in a database.

So I probably go with the following keys (in Python syntax):

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version
'status',
'requestor', # 'britney', 'debci' or e-mail
'pin_packages', # []
# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)
'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense
but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same
status can result in different messages. I think just a field in addition
will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

I wonder what might be the meaning of pin_packages which is always
equal [].

Kind regards

Andreas.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul Gevers@21:1/5 to Andreas Tille on Fri Apr 10 21:10:02 2020

This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --ExrphwlnfOPqm0Hm0SPsnvDjbEV531VMg
Content-Type: text/plain; charset=utf-8
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with
some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure"
suite runs, to pin_packages is always empty. pin_packages contains which packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq # "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense
but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same status can result in different messages. I think just a field in addition will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my
name there, I had to clean up several times and to be able to find those
back, I added my nick to the message. The list thus may change over time.

Paul

--ExrphwlnfOPqm0Hm0SPsnvDjbEV531VMg--

-----BEGIN PGP SIGNATURE-----

iQEzBAEBCAAdFiEEWLZtSHNr6TsFLeZynFyZ6wW9dQoFAl6Qw3wACgkQnFyZ6wW9 dQqtbAf7ByZCprwWtpt/Vg5jAMSXwjaGnqVEhYsc/yVEapxBlNFXBrMy4Cma68SE 11AuzJPIpcvj3eW4g9ZEL+Bd75QM1/3ugw062bqQNxSPSJQS9aLzUbDXeN6W0K+l nm4pvweeoJUBy90JHAts68MKpFbv+8US1ziSm+HaV6k/okkf1dgbV7z5TCdn1fqN hCmk7ho0Yke2cWvsOhdfJEOOl7idQl3zy1MCXoYlB9zG8nCHkjClE9axJknIKCfr O+/wRrNkiXwtQv9m4gaCwJwmhQXRKnRc9wC0jbKJSI0onkIKw6cGq2Cng7fCdEej c4nSDDex4HBsxkT/sx39iPAntyOWkQ==
=2qng
-----END PGP SIGNATURE-----

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Paul Gevers on Sat Apr 11 07:20:02 2020

Hi Paul,

thanks for the clarification. This commit

https://salsa.debian.org/qa/udd/-/commit/6a874a89365671dd37a14a9bca25290dc55a1fc9

imports the current data. I will tests this a bit more and than activate
in a cron job as importer.

Thanks a lot for your contribution

Andreas.

On Fri, Apr 10, 2020 at 09:05:31PM +0200, Paul Gevers wrote:

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with
some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure"
suite runs, to pin_packages is always empty. pin_packages contains which packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense
but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same status can result in different messages. I think just a field in addition will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my
name there, I had to clean up several times and to be able to find those back, I added my nick to the message. The list thus may change over time.

Paul

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Lucas Nussbaum@21:1/5 to Andreas Tille on Mon Apr 13 22:50:02 2020

Hi Andreas,

I'm sorry if I haven't paid enough attention. But what is the difference
with the 'ci' importer?

https://salsa.debian.org/qa/udd/-/blob/master/rimporters/ci.rb

Thanks

Lucas

On 11/04/20 at 07:12 +0200, Andreas Tille wrote:

Hi Paul,

thanks for the clarification. This commit

https://salsa.debian.org/qa/udd/-/commit/6a874a89365671dd37a14a9bca25290dc55a1fc9

imports the current data. I will tests this a bit more and than activate
in a cron job as importer.

Thanks a lot for your contribution

Andreas.

On Fri, Apr 10, 2020 at 09:05:31PM +0200, Paul Gevers wrote:

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with
some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure"
suite runs, to pin_packages is always empty. pin_packages contains which packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same status can result in different messages. I think just a field in addition
will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my name there, I had to clean up several times and to be able to find those back, I added my nick to the message. The list thus may change over time.

Paul

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Lucas Nussbaum on Tue Apr 14 06:20:01 2020

Hi Lucas,

On Mon, Apr 13, 2020 at 10:37:57PM +0200, Lucas Nussbaum wrote:

I'm sorry if I haven't paid enough attention. But what is the difference
with the 'ci' importer?

https://salsa.debian.org/qa/udd/-/blob/master/rimporters/ci.rb

On 11/04/20 at 07:12 +0200, Andreas Tille wrote:

Hi Paul,

thanks for the clarification. This commit

https://salsa.debian.org/qa/udd/-/commit/6a874a89365671dd37a14a9bca25290dc55a1fc9

imports the current data. I will tests this a bit more and than activate in a cron job as importer.

Thanks a lot for your contribution

Andreas.

On Fri, Apr 10, 2020 at 09:05:31PM +0200, Paul Gevers wrote:

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure"
suite runs, to pin_packages is always empty. pin_packages contains which packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same
status can result in different messages. I think just a field in addition
will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my name there, I had to clean up several times and to be able to find those back, I added my nick to the message. The list thus may change over time.

Paul

--
http://fam-tille.de

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to All on Wed Apr 15 22:50:01 2020

Hi,

I think I've found some inconsistency of the autopkgtest data: packages_unstable_amd64.json contains:

{
"run_id": 4963349,
"created_at": "2020-04-13 12:01:58 UTC",
"updated_at": "2020-04-14 20:25:31 UTC",
"suite": "unstable",
"arch": "amd64",
"package": "dnapi",
"version": "1.1-1",
"trigger": null,
"status": "neutral",
"requestor": "debci",
"pin_packages": [

],
"worker": null,
"date": "2020-04-14 19:48:49 UTC",
"duration_seconds": 32,
"last_pass_date": null,
"last_pass_version": "",
"message": "No tests in this package or all skipped",
"previous_status": "neutral",
"duration_human": "32s"
}

But there is a test available:

udd=# SELECT source, release, testsuite FROM sources WHERE source = 'dnapi' ;
source | release | testsuite
--------+---------+------------------------
dnapi | sid | autopkgtest-pkg-python

and it seems to be run:

https://ci.debian.net/data/autopkgtest/unstable/amd64/d/dnapi/4963349/log.gz

says:

autopkgtest [19:48:46]: test autodep8-python3: set -e ; for py in $(py3versions -r 2>/dev/null) ; do cd "$AUTOPKGTEST_TMP" ; echo "Testing with $py:" ; $py -c "import dnapilib; print(dnapilib)" ; done
autopkgtest [19:48:46]: test autodep8-python3: [-----------------------
Testing with python3.8:
<module 'dnapilib' from '/usr/lib/python3/dist-packages/dnapilib/__init__.py'> autopkgtest [19:48:47]: test autodep8-python3: -----------------------] autopkgtest [19:48:47]: test autodep8-python3: - - - - - - - - - - results - - - - - - - - - -
autodep8-python3 PASS (superficial)
autopkgtest [19:48:47]: @@@@@@@@@@@@@@@@@@@@ summary
autodep8-python3 PASS (superficial)

So shouldn't this be rather "pass" than "neutral"?

Kind regards

Andreas.

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Paul Gevers on Thu Apr 16 10:10:01 2020

On Thu, Apr 16, 2020 at 09:32:43AM +0200, Paul Gevers wrote:

Hi Andreas,

On 15-04-2020 22:45, Andreas Tille wrote:

autodep8-python3 PASS (superficial)

superficial is translated to neutral. As is FAIL (flaky).

Hmmmm, what exactly means "superficial". Are all those

Testsuite: autopkgtest-pkg-*

superficial? Do they qualify for early testing migration or not?
Wouldn't it be more informative to have a fourth category

pass
superficial
neutral
fail

My intention was to have a list with packages of our team where either a
test is missing or failing. My idea was that some autopkgtest-pkg-* is
"test is not missing". What is your opinion as debian-ci team about my
idea?

Kind regards

Andreas.

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul Gevers@21:1/5 to Andreas Tille on Thu Apr 16 09:40:02 2020

This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --ueFC6bGlsXVJLcLQnWRz5ekmGD9mT94Go
Content-Type: text/plain; charset=utf-8
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable

Hi Andreas,

On 15-04-2020 22:45, Andreas Tille wrote:

autodep8-python3 PASS (superficial)

superficial is translated to neutral. As is FAIL (flaky).

Paul

--ueFC6bGlsXVJLcLQnWRz5ekmGD9mT94Go--

-----BEGIN PGP SIGNATURE-----

iQEzBAEBCAAdFiEEWLZtSHNr6TsFLeZynFyZ6wW9dQoFAl6YCiQACgkQnFyZ6wW9 dQrCMAgApr7FkgoAW//iAoHaDiw5EcVIesutJGmVden/M5dD3uVwFB2DwZuq6IT7 SnIPiexY6+Nb/r8ZmHn1WzTxoGRQ550qZNX7emrOHTISP9y9iO43eX/WzRaOgBf/ VWgQQHnMuyyg9cWpta7YX0Pqk0YvguxvBNZZLRgw40d0pdRGEcwwdDMu3LJfeCjC bVFs1Jj4eLE7KZdxJA5sHq7bknrzhscrvnn42AItLHTI/7AslR6tEXfN7HbLVP5v pMORTqDBtV/Sd+QdPjzyr5+8J+PqamAXNnUOefqwN+zAfcKtAt+bRfRBgFqRNwnp jggq7U85RKyaBbUe9+ayUomqarIHKw==
=Hi/f
-----END PGP SIGNATURE-----

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Simon McVittie@21:1/5 to Andreas Tille on Thu Apr 16 11:00:01 2020

On Thu, 16 Apr 2020 at 10:09:36 +0200, Andreas Tille wrote:

Hmmmm, what exactly means "superficial".

Originally proposed in: https://salsa.debian.org/ci-team/autopkgtest/-/merge_requests/19

Implemented in:
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=904979

The design was originally called "trivial" but was renamed to
"superficial" during implementation.

Are all those

Testsuite: autopkgtest-pkg-*

superficial?

Most of them yes, because if they fail, that's Very Bad, but if they
succeed, they don't give us a whole lot of confidence that the package
works as intended.

For example, if you have a package python3-dbus, the most that can be
done with knowledge of Python but no knowledge of dbus is to "import dbus"
and see what happens. If that fails, obviously the dbus module is
unusable (or missing a dependency or something); but if that succeeds,
that fact doesn't tell us whether it can actually do D-Bus successfully,
which is its real purpose. If I replaced all the code in python3-dbus
with print("hello"), obviously it wouldn't be implementing its intended
API any more, but it would still pass the "import" test. That's why the "import" test is considered to be superficial.

If a family of packages have a convention for how to discover and
run a real test suite with non-trivial coverage (for example GNOME-style installed-tests in /usr/share/installed-tests/**/*.test), then an
autopkgtest-* helper that detected and used *that* convention would usually *not* be superficial.

Do they qualify for early testing migration or not?

If *all* the tests are superficial, passing them doesn't speed up testing migration.

Packages get faster testing migration if they have at least one test that results in a pass status (passing and not superficial), *and* all of their tests result in either a pass or neutral status.

Normal tests: pass -> pass, fail -> fail

"superficial" tests: pass -> neutral, fail -> fail
(the test passing doesn't really give us much confidence that it works)

"flaky" tests: pass -> pass, fail -> neutral
(the fail doesn't give us much confidence that it *doesn't* work, because
maybe it just failed randomly)

"flaky, superficial" tests: pass -> neutral, fail -> neutral
(no machine-readable effect, should normally only be used when monitoring whether a new test is reliable on Debian infrastructure or not)

"skippable" tests: pass -> as above, exit 77 -> neutral, fail -> as above

Tests that can't be run due to restrictions: neutral

smcv

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul Gevers@21:1/5 to Andreas Tille on Thu Apr 16 10:40:03 2020

This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --0cHrOphQ0nY3JKjWElOyG6Ev2YAKmrvrk
Content-Type: text/plain; charset=utf-8
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable

Hi Andreas,

On 16-04-2020 10:09, Andreas Tille wrote:

superficial is translated to neutral. As is FAIL (flaky).

Hmmmm, what exactly means "superficial".

Please read the documentation: https://salsa.debian.org/ci-team/autopkgtest/raw/master/doc/README.package-tests.rst

Are all those
Testsuite: autopkgtest-pkg-*

superficial?

No. Only those that only have superficial tests. E.g. ruby runs the full upstream test-suite automatically.

Do they qualify for early testing migration or not?

Superficial tests *are* neutral, so no.

Wouldn't it be more informative to have a fourth category

pass
superficial
neutral
fail

No, because there are only three states. I.e. superficial and flaky and
skipped tests all end up meaning the same.

My intention was to have a list with packages of our team where either a
test is missing or failing. My idea was that some autopkgtest-pkg-* is
"test is not missing". What is your opinion as debian-ci team about my
idea?

It seems you want to be processing the message field then, but honestly
if there is an entry, "test is not missing". If there is no entry, "test
is missing". Failing is "fail". flaky tests are also "test is not
missing", skipped tests are also "test is not missing". Why would you
need all those states? You have the "message" field in your UDD schema
to check why you get the neutral state.

Paul

--0cHrOphQ0nY3JKjWElOyG6Ev2YAKmrvrk--

-----BEGIN PGP SIGNATURE-----

iQEzBAEBCAAdFiEEWLZtSHNr6TsFLeZynFyZ6wW9dQoFAl6YF5kACgkQnFyZ6wW9 dQprkwgAvrkmHvdSEmKjk4C7NGL9+uVU55jP/8bvLK69pPNPU1mfqV9Wd9Pa1jS2 tvPmSrKVlk5GSkLUINzLYhVudaf7CbunAJ8oG+oodzv5f+DOApzcMdCE/FUL9SWJ nDBNoYjBs2X2ht8zBpbafwVmzZLDpEIiO7XzW+iHl1rEX51VvyfRlmO6zmms+Ce0 caHkQ0E2otDz8EmY1LiON55i/948RJ5CjDt5uikbvp+HSGmr5npNT/ev1oSnABPR R23Vctc/xTuWFfkE3BKuHYxV/B6/B5GpoygLwSrAujqyJnMMKMdJ9a0zoLzFN8/f aozRAsbMvkC+KqJ9lwOHWwzkD7qJZg==
=Q/P7
-----END PGP SIGNATURE-----

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Paul Gevers on Thu Apr 16 11:10:02 2020

Hi Paul,

On Thu, Apr 16, 2020 at 10:30:17AM +0200, Paul Gevers wrote:

Hmmmm, what exactly means "superficial".

Please read the documentation: https://salsa.debian.org/ci-team/autopkgtest/raw/master/doc/README.package-tests.rst

Thanks for this pointer.

Are all those
Testsuite: autopkgtest-pkg-*

superficial?

No. Only those that only have superficial tests. E.g. ruby runs the full upstream test-suite automatically.

Ahhh!

Do they qualify for early testing migration or not?

Superficial tests *are* neutral, so no.

Wouldn't it be more informative to have a fourth category

pass
superficial
neutral
fail

No, because there are only three states. I.e. superficial and flaky and skipped tests all end up meaning the same.

My intention was to have a list with packages of our team where either a test is missing or failing. My idea was that some autopkgtest-pkg-* is "test is not missing". What is your opinion as debian-ci team about my idea?

It seems you want to be processing the message field then, but honestly
if there is an entry, "test is not missing". If there is no entry, "test
is missing". Failing is "fail". flaky tests are also "test is not
missing", skipped tests are also "test is not missing". Why would you
need all those states? You have the "message" field in your UDD schema
to check why you get the neutral state.

I definitely want to expose the full message (that's why I insisted to
put it into the table). My intention is to expose only those packages
where some work needs to be done. Otherwise the list will be to long.
I will think about the plan after receiving your information.

Kind regards

Andreas.

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From gregor herrmann@21:1/5 to Simon McVittie on Thu Apr 16 15:20:02 2020

On Thu, 16 Apr 2020 09:56:15 +0100, Simon McVittie wrote:

Are all those

Testsuite: autopkgtest-pkg-*

superficial?

Most of them yes, because if they fail, that's Very Bad, but if they
succeed, they don't give us a whole lot of confidence that the package
works as intended.

For example, if you have a package python3-dbus, the most that can be
done with knowledge of Python but no knowledge of dbus is to "import dbus" and see what happens. If that fails, obviously the dbus module is
unusable (or missing a dependency or something); but if that succeeds,
that fact doesn't tell us whether it can actually do D-Bus successfully, which is its real purpose. If I replaced all the code in python3-dbus
with print("hello"), obviously it wouldn't be implementing its intended
API any more, but it would still pass the "import" test. That's why the "import" test is considered to be superficial.

If a family of packages have a convention for how to discover and
run a real test suite with non-trivial coverage (for example GNOME-style installed-tests in /usr/share/installed-tests/**/*.test), then an autopkgtest-* helper that detected and used *that* convention would usually *not* be superficial.

Very similar in the perl case, and autodep8 shows the restrictions:

% autodep8
Test-Command: /usr/share/pkg-perl-autopkgtest/runner build-deps
Depends: @, @builddeps@, pkg-perl-autopkgtest
Restrictions: skippable
Features: test-name=autodep8-perl-build-deps

Test-Command: /usr/share/pkg-perl-autopkgtest/runner runtime-deps
Depends: @, pkg-perl-autopkgtest
Restrictions: skippable, superficial
Features: test-name=autodep8-perl

Test-Command: /usr/share/pkg-perl-autopkgtest/runner runtime-deps-and-recommends
Depends: @, pkg-perl-autopkgtest
Restrictions: needs-recommends, skippable, superficial
Features: test-name=autodep8-perl-recommends

The first one runs "smoke" which executes the upstream test suite, so
not superficial; the second and third ones run use.t and syntax.t
which try to load the module and do a syntax check, respectively, and
they are supercifial, for the reasons Simon explained for the python
case above.

Cheers,
gregor

--
.''`. https://info.comodo.priv.at -- Debian Developer https://www.debian.org
: :' : OpenPGP fingerprint D1E1 316E 93A7 60A8 104D 85FA BB3A 6801 8649 AA06
`. `' Member VIBE!AT & SPI Inc. -- Supporter Free Software Foundation Europe
`- NP: Peter Ratzenbeck: Bonny Bessie Logan

-----BEGIN PGP SIGNATURE-----

iQKTBAEBCgB9FiEE0eExbpOnYKgQTYX6uzpoAYZJqgYFAl6YWzhfFIAAAAAALgAo aXNzdWVyLWZwckBub3RhdGlvbnMub3BlbnBncC5maWZ0aGhvcnNlbWFuLm5ldEQx RTEzMTZFOTNBNzYwQTgxMDREODVGQUJCM0E2ODAxODY0OUFBMDYACgkQuzpoAYZJ qgae1A/8D30yIT3WaKm3n/vS8QULJFvgdfQTD5sNw8tzPMHXpHLy6YmfvvNLHi7p W2j5s1Sit6P9GNJzOp7Qml7Q1hBGqkUpDwQdGWAG/NpccM4nZOV5stEsz6Jic+kK MJ/xWZuBm6xPwSM38irfoJqoST0oHN4NKLTMAaJ4sX85zmC0po75Wc4ZQT6N9uTv yHAMVZfDN3W2V2G3MzlfU9yz7Iq8buguSgdf9PI7/Au3aHcij3dswT410kpn4fUO 8LHbAfgTQJRygT4+sc6bMz/ejfgqzkIqWSVq0Q40TL9Pkv8+TOINV7l6zzDxeZeF Xl7+83aw0YXDVJQt3hmDtxsKvSu8RZRMsZpGZ4YbBEury75CeHuwPhnFOxhQvzvt dXHKjsW2AqrAO17XTWrmcbgiijuja2AXhxCEvszzEMv3moDxLQjFfC/8uXc4lwwA iTMo5Yrhb0JCz1nD/Dox24izAME9Wp+N/gtcr9jwJFQIl/CjRv9BQCej35cWC7ex Sg9JXRkfamATX0Cb+ltrlIbG9GqjyfG+x2kuJFuyLDJ44FZ0qR+lzAZWM9cKmEXj Pt848yYK9cqmiBIKMISNm3lV7OT8yvfhn0xxveElcwmRoVOP7qTRIBP/uqF9+vCf 0BXLC5zaoS2gYebMMmlrjj0Q4X/8LiJv2ANi6WpG38eK/NlrWzA=
=lJZf
-----END PGP SIGNATURE-----

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Andreas Tille on Tue Apr 28 09:30:02 2020

Hi again Lucas,

may be my mail slipped through but basically the difference between the two importers is that you are just parsing

https://ci.debian.net/data/status/unstable/amd64/packages.json

while the Python script is parsing

https://ci.debian.net/data/status/*/*/packages.json

You even have line 5

# FIXME there might be more suites at some point

so you was aware of that issue. Would you mind solving that FIXME? Sorry,
I do not speak Ruby.

Kind regards

Andreas.

On Tue, Apr 14, 2020 at 06:12:39AM +0200, Andreas Tille wrote:

Hi Lucas,

On Mon, Apr 13, 2020 at 10:37:57PM +0200, Lucas Nussbaum wrote:

I'm sorry if I haven't paid enough attention. But what is the difference with the 'ci' importer?

https://salsa.debian.org/qa/udd/-/blob/master/rimporters/ci.rb

I think the problem is that UDD is not documented and I simply did not
know about the ci table. :-(

However, when looking at it there is a difference:

udd=# select status, arch, count(*) from ci group by status, arch order by status, arch;
status | arch | count
---------+-------+-------
fail | amd64 | 925
neutral | amd64 | 1593
pass | amd64 | 10805
tmpfail | amd64 | 14
(4 Zeilen)

udd=# select status, architecture, count(*) from autopkgtest group by status, architecture order by status, architecture;
status | architecture | count
---------+--------------+-------
fail | amd64 | 1561
fail | arm64 | 1471
fail | ppc64el | 711
neutral | amd64 | 2879
neutral | arm64 | 1322
neutral | ppc64el | 298
pass | amd64 | 21373
pass | arm64 | 11114
pass | ppc64el | 2458
tmpfail | amd64 | 11
tmpfail | arm64 | 85
tmpfail | ppc64el | 7
(12 Zeilen)

I guess we should merge both and make sure that all data is imported.
I would never have written a new importer if I would have been aware
of the existing one - but I do not speak Ruby to fix the existing one.

Kind regards
Andreas.

On 11/04/20 at 07:12 +0200, Andreas Tille wrote:

Hi Paul,

thanks for the clarification. This commit

https://salsa.debian.org/qa/udd/-/commit/6a874a89365671dd37a14a9bca25290dc55a1fc9

imports the current data. I will tests this a bit more and than activate in a cron job as importer.

Thanks a lot for your contribution

Andreas.

On Fri, Apr 10, 2020 at 09:05:31PM +0200, Paul Gevers wrote:

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure" suite runs, to pin_packages is always empty. pin_packages contains which
packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense
but I tend to leave message since leaving this out looks like loosing information. I tried to find a relation to status but it seems the same
status can result in different messages. I think just a field in addition
will not blow up UDD way more than it recently is - may be I consider a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my
name there, I had to clean up several times and to be able to find those
back, I added my nick to the message. The list thus may change over time.

Paul

--
http://fam-tille.de

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Lucas Nussbaum@21:1/5 to Andreas Tille on Fri May 1 23:10:02 2020

Hi,

I fixed it and added the same suites as you were, reverted your changes
and dropped the autopkgtest table. So everything should be fine.

Sorry for not following closely enough when you started to work on that.

Lucas

On 28/04/20 at 09:23 +0200, Andreas Tille wrote:

Hi again Lucas,

may be my mail slipped through but basically the difference between the two importers is that you are just parsing

https://ci.debian.net/data/status/unstable/amd64/packages.json

while the Python script is parsing

https://ci.debian.net/data/status/*/*/packages.json

You even have line 5

# FIXME there might be more suites at some point

so you was aware of that issue. Would you mind solving that FIXME? Sorry,
I do not speak Ruby.

Kind regards

Andreas.

On Tue, Apr 14, 2020 at 06:12:39AM +0200, Andreas Tille wrote:

Hi Lucas,

On Mon, Apr 13, 2020 at 10:37:57PM +0200, Lucas Nussbaum wrote:

I'm sorry if I haven't paid enough attention. But what is the difference with the 'ci' importer?

https://salsa.debian.org/qa/udd/-/blob/master/rimporters/ci.rb

I think the problem is that UDD is not documented and I simply did not
know about the ci table. :-(

However, when looking at it there is a difference:

udd=# select status, arch, count(*) from ci group by status, arch order by status, arch;
status | arch | count
---------+-------+-------
fail | amd64 | 925
neutral | amd64 | 1593
pass | amd64 | 10805
tmpfail | amd64 | 14
(4 Zeilen)

udd=# select status, architecture, count(*) from autopkgtest group by status, architecture order by status, architecture;
status | architecture | count
---------+--------------+-------
fail | amd64 | 1561
fail | arm64 | 1471
fail | ppc64el | 711
neutral | amd64 | 2879
neutral | arm64 | 1322
neutral | ppc64el | 298
pass | amd64 | 21373
pass | arm64 | 11114
pass | ppc64el | 2458
tmpfail | amd64 | 11
tmpfail | arm64 | 85
tmpfail | ppc64el | 7
(12 Zeilen)

I guess we should merge both and make sure that all data is imported.
I would never have written a new importer if I would have been aware
of the existing one - but I do not speak Ruby to fix the existing one.

Kind regards
Andreas.

On 11/04/20 at 07:12 +0200, Andreas Tille wrote:

Hi Paul,

thanks for the clarification. This commit

https://salsa.debian.org/qa/udd/-/commit/6a874a89365671dd37a14a9bca25290dc55a1fc9

imports the current data. I will tests this a bit more and than activate
in a cron job as importer.

Thanks a lot for your contribution

Andreas.

On Fri, Apr 10, 2020 at 09:05:31PM +0200, Paul Gevers wrote:

Hi Andreas,

On 09-04-2020 22:53, Andreas Tille wrote:

valid_keys = ( 'run_id',
# 'created_at', # Paul Gevers: should be ignored
# 'updated_at', # Paul Gevers: should be ignored
'suite',
'arch',
'package', # ----> should be renamed to 'source'
'version',
'trigger', # usually package.*version

I expected you to mostly see "" or "migration-reference/0" here, with some hand crafted text from random DD's.

'status',
'requestor', # 'britney', 'debci' or e-mail

Debian login to be precise, not e-mail.

'pin_packages', # []

Since a couple of months this json and other pages only show "pure" suite runs, to pin_packages is always empty. pin_packages contains which
packages are taken from another suite than the base suite.

# 'worker', # Paul Gevers: should be ignored (is 'null' anyway)

Oh, bug somewhere I guess.

'date',
'duration_seconds',
'last_pass_date',
'last_pass_version',
'message', # see below
'previous_status',
# 'duration_human', # Paul Gevers: duration_seconds and duration_human feel double and the former is leaner for in a database
# 'blame', # Paul Gevers: should be ignored
)

# message can be
# $ grep '"message"' packages*.json | sed 's/^.*\.json: *//' | sort | uniq
# "message": "All tests passed" -> "status": "pass"
# "message": "Could not run tests due to a temporary testbed failure" -> "status": "tmpfail"
# "message": "elbrus" -> "status": "tmpfail"
# "message": "Erroneous package" -> "status": "fail"
# "message": null -> "status": "fail"
# "message": "No tests in this package or all skipped" -> "status": "neutral"
# "message": "Tests failed", -> "status": "fail"
# "message": "Tests failed, and at least one test skipped" -> "status": "fail"
# "message": "Tests passed, but at least one test skipped" -> "status": "pass"
# "message": "Unexpected autopkgtest exit code 20" -> "status": "tmpfail"

I agree that leaving out worker which is really always null makes sense
but I tend to leave message since leaving this out looks like loosing
information. I tried to find a relation to status but it seems the same
status can result in different messages. I think just a field in addition
will not blow up UDD way more than it recently is - may be I consider
a normalised form, but usually UDD is not very normalised at all.

In general this is the final output from autopkgtest. But, as you see my
name there, I had to clean up several times and to be able to find those
back, I added my nick to the message. The list thus may change over time.

Paul

--
http://fam-tille.de

--
http://fam-tille.de

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Lucas Nussbaum on Sat May 2 09:40:02 2020

Hi Lucas,

On Fri, May 01, 2020 at 10:57:39PM +0200, Lucas Nussbaum wrote:

I fixed it and added the same suites as you were, reverted your changes
and dropped the autopkgtest table. So everything should be fine.

Thanks for this.

Sorry for not following closely enough when you started to work on that.

No problem - I've learned something anyway. ;-)

BTW, if I just want to update my local mirror, how can I do this:

$ /srv/udd.debian.org/udd/rudd ci
Traceback (most recent call last):
/srv/udd.debian.org/udd/rudd:62:in `<main>': No option specified (RuntimeError)

Kind regards

Andreas.

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Andreas Tille@21:1/5 to Andreas Tille on Sat May 2 09:50:01 2020

Hi Lucas,

please forget my question - I simply missed the '--run' option.

Sorry for the noise, Andreas.

On Sat, May 02, 2020 at 09:37:43AM +0200, Andreas Tille wrote:

Hi Lucas,

On Fri, May 01, 2020 at 10:57:39PM +0200, Lucas Nussbaum wrote:

I fixed it and added the same suites as you were, reverted your changes
and dropped the autopkgtest table. So everything should be fine.

Thanks for this.

Sorry for not following closely enough when you started to work on that.

No problem - I've learned something anyway. ;-)

BTW, if I just want to update my local mirror, how can I do this:

$ /srv/udd.debian.org/udd/rudd ci
Traceback (most recent call last):
/srv/udd.debian.org/udd/rudd:62:in `<main>': No option specified (RuntimeError)

Kind regards

Andreas.

--
http://fam-tille.de

--
http://fam-tille.de

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Lucas Nussbaum@21:1/5 to Andreas Tille on Sat May 2 09:50:01 2020

On 02/05/20 at 09:37 +0200, Andreas Tille wrote:

Hi Lucas,

On Fri, May 01, 2020 at 10:57:39PM +0200, Lucas Nussbaum wrote:

I fixed it and added the same suites as you were, reverted your changes
and dropped the autopkgtest table. So everything should be fine.

Thanks for this.

Sorry for not following closely enough when you started to work on that.

No problem - I've learned something anyway. ;-)

BTW, if I just want to update my local mirror, how can I do this:

$ /srv/udd.debian.org/udd/rudd ci
Traceback (most recent call last):
/srv/udd.debian.org/udd/rudd:62:in `<main>': No option specified (RuntimeError)

rudd --built-in ci
or
rudd --run ci

Lucas

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	300
Nodes:	16 (2 / 14)
Uptime:	57:00:11
Calls:	6,712
Files:	12,243
Messages:	5,355,478

[UDD] Is there any information about failed autopkgtest in UDD?

Who's Online

System Info