proposal: testing: store test artifacts #71287

neild · 2025-01-15T22:05:02Z

This is an offshoot of #43936.

Some tests produce output files which the user may want to examine. For example, a test might produce files which are compared to some reference. If the comparison fails, the user will want to examine the generated files. Or a test might produce a packet capture or similar trace which can be used to debug failures.

We call these output files "test artifacts".

This is a proposal to add support for storing test artifacts to the testing package.

We add a new method to testing.TB:

package testing

// OutputDir returns a directory for the test to store output files in.
// When the -outputdir flag is provided, this directory will be located
// under that directory. Otherwise, OutputDir returns a temporary directory
// which is removed after the test completes.
//
// Each test or subtest has a unique artifact directory.
// Repeated calls to OutputDir in the same test or subtest return the same directory.
// Subtest outputs are not located under the parent test's output directory.
func (t *testing.T) OutputDir() string

The -outputdir flag already exists, and is currently used to specify a location to put output files from profiling. We're adding an additional meaning to it here: It's now where all your saved test outputs go.

When -outputdir is specified, the first call to OutputDir in a test or subtest will emit a line to the test output consisting of "=== ARTIFACTS ", the test name, and the test artifact directory, separated by spaces:

=== ARTIFACTS TestName/subtest_name /path/to/root/artifact/dir/TestName__subtest_name

When -json is specified, this will appear as an entry with an Action of "artifacts", the usual Time, Package, and Test keys, and a "Path" key containing the artifact directory:

{"Time":"2025-01-15T13:39:27.75235-08:00","Action":"artifacts","Package":"path","Test":"TestName","Path":"/path/to/root/artifact/dir/TestName"}

That's the proposal.

A few points on the design:

I'm reusing the existing -outputdir flag, on the theory that output files from profiling are just another test artifact. If we don't like that reuse, then we could add a new -artifactdir flag and rename TB.OutputDir to TB.ArtifactDir for consistency.
The test output uses the word "ARTIFACTS" because the JSON already has "output" events.
TB.OutputDir returns a directory, same as TB.TempDir. This seems simpler than asking the testing package to copy files around.
TB.OutputDir returns a directory even if we aren't saving artifacts so test behavior doesn't change depending on the presence or absence of the -outputdir flag.

In simple interactive use, users can pass -outputdir to store test artifacts when debugging a failing test.

Test frameworks that collect artifacts can arrange to pass -outputdir to the tests they run and collect any artifacts after the fact.

As a concrete use case, within Google our testing infrastructure sets an environment variable to the location of a directory. Tests can write files into this directory, and those files will be stored and associated with the test run. If we implement this proposal, we can arrange for the test infrastructure to also pass this directory as an -outputdir flag, and any test using TB.OutputDir will automatically use the right location.

The text was updated successfully, but these errors were encountered:

mknyszek · 2025-01-15T22:34:16Z

Speaking from the perspective of Go's CI infrastructure, it would be helpful to have some standard convention to indicate output artifacts at the standard library level, so I'm in support of this proposal (or something like it).

We share the tooling for processing Go tests with several other teams, and without a standard for this sort of thing, we would be stuck having to define our own amongst ourselves. That may turn out to be OK in practice, but having a single way to do it would save us all the time of trying to define our own convention, since it wouldn't even be a question. As far as test metadata goes, I think this would be one of the most valuable forms, for the Go team anyway, and IMO more valuable than Attr on its own due the well-defined convention. I can think of several use-cases for exactly this:

Execution trace tests and pprof tests would emit artifacts when they encounter broken traces produced by the runtime, for debugging.
Crashes in CI would result in core dumps that can be easily downloaded and dissected. (So many more issues would be possible to debug!)
Integration tests for binary format serialization/deserialization could dump intermediate artifacts when things go wrong.

I don't have many thoughts on the specifics of how this should look in the output or in the testing package, but from my perspective this doesn't seem like much of an overreach into framework territory. It's a fairly common and simple piece of functionality, and again, essentially just defining a convention.

seankhliao · 2025-01-15T23:10:39Z

How would this interact with fuzzing which currently stores new inputs that cause crashes in testdata/fuzz/ ?

neild · 2025-01-15T23:28:09Z

No changes to fuzzing, unless I'm missing something. Is there some interaction that should happen there?

seankhliao · 2025-01-15T23:42:46Z

I was thinking they're similar in that they generate test artifacts, and if you're running in some remote CI system, you may want a (relatively) easy way to extract the new additions from the other fuzz inputs, such as redirecting them to a specified directory as in this proposal.

neild · 2025-01-16T00:03:29Z

I don't think we can have -outputdir stop us from writing fuzz crashes to testdata/fuzz; anyone using fuzzing now is presumably depending on the current behavior and won't want it changed by an orthogonal feature.

Perhaps we could write fuzz crashes to both testdata/fuzz and -outputdir. But I think simpler would be to have a separate -fuzzoutputdir flag to set where new fuzz crashes get written. If you want everything in the same place, set both -fuzzoutputdir and -outputdir.

I think a -fuzzoutputdir flag is probably a separate proposal, though.

earthboundkid · 2025-01-16T00:03:35Z

Instead of returning a string, could it return an os.Root?

neild · 2025-01-16T00:07:00Z

Instead of returning a string, could it return an os.Root?

It could, but that'd be a gratuitous divergence from TB.TempDir. If you want a Root, it's easy enough to open the directory as one. And sometimes you do want just the directory name (when passing it as a flag to some other process, say).

willfaught · 2025-01-16T19:42:32Z

If I run such tests for an unfamiliar package with just go test, will my system accrue these output artifacts permanently without my knowing? How do I clean these, or prevent them entirely?

Where will the paths likely be? Under /tmp, under the package, under GOPATH?

Do tests get unique paths over multiple test runs, or does one test run overwrite the artifacts of another for the same test? If overwrite, are all previous artifacts deleted first?

I was thinking they're similar in that they generate test artifacts, and if you're running in some remote CI system, you may want a (relatively) easy way to extract the new additions from the other fuzz inputs, such as redirecting them to a specified directory as in this proposal.

Perhaps fuzz artifacts should be thought of as generated inputs, not outputs, so they shouldn't be included in this.

neild · 2025-01-16T19:58:58Z

If I run such tests for an unfamiliar package with just go test, will my system accrue these output artifacts permanently without my knowing? How do I clean these, or prevent them entirely?

If you don't pass the --outputdir flag to "go test", artifacts get written to a temporary directory (exactly the same as if they were written to a t.TempDir) and are deleted after the test finishes.

If you do pass --outputdir, artifacts get written to that directory and you can decide what to do with them from there. Persistent artifacts are only produced when you ask for them, in the location you specify.

Do tests get unique paths over multiple test runs, or does one test run overwrite the artifacts of another for the same test? If overwrite, are all previous artifacts deleted first?

That is an excellent question. I'm going to have to think about it.

gopherbot added the Proposal label Jan 15, 2025

gopherbot added this to the Proposal milestone Jan 15, 2025

neild mentioned this issue Jan 15, 2025

proposal: testing: structured output for test attributes #43936

Open

gabyhelp added the LibraryProposal Issues describing a requested change to the Go standard library or x/ libraries, but not to a tool label Jan 15, 2025

ianlancetaylor added this to Proposals Jan 16, 2025

ianlancetaylor moved this to Incoming in Proposals Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: testing: store test artifacts #71287

proposal: testing: store test artifacts #71287

neild commented Jan 15, 2025

mknyszek commented Jan 15, 2025 •

edited

Loading

seankhliao commented Jan 15, 2025

neild commented Jan 15, 2025

seankhliao commented Jan 15, 2025

neild commented Jan 16, 2025

earthboundkid commented Jan 16, 2025

neild commented Jan 16, 2025

willfaught commented Jan 16, 2025 •

edited

Loading

neild commented Jan 16, 2025

proposal: testing: store test artifacts #71287

proposal: testing: store test artifacts #71287

Comments

neild commented Jan 15, 2025

mknyszek commented Jan 15, 2025 • edited Loading

seankhliao commented Jan 15, 2025

neild commented Jan 15, 2025

seankhliao commented Jan 15, 2025

neild commented Jan 16, 2025

earthboundkid commented Jan 16, 2025

neild commented Jan 16, 2025

willfaught commented Jan 16, 2025 • edited Loading

neild commented Jan 16, 2025

mknyszek commented Jan 15, 2025 •

edited

Loading

willfaught commented Jan 16, 2025 •

edited

Loading