[PATCH] Add a TEST-OP-TEST-FAILURE condition for test libraries to sub-class

Tue Oct 8 04:17:21 UTC 2019

Vladmir, the library content is not just the signal trick (or special vars).

It defines the unified common test result representation, provides a bridge
between the protocol test libraries "signal" their result and the protocol
the user running tests consumes the results. The library can also
provide means to create the common test results as a normal return value;
or maybe deliver it through a stack-based mechanism, but explicitly,
without relying on test frameworks to do it automatically (in some cases
I anticipate explicit creation of the result will be more convenient)

And yes, the most important, adding such functionality into the public ASDF API
requires too much upfront thinking. After separate library experiments with
the approach and stabilizes, ASDF can incorporate, or officially refer it.

As Robert mentions in the pull request, having this functionality in new
versions of ASDF would require the consumer to check feature flags
in their code. In the case of a separate library, the consumer can just load
this library, and it will work even with old versions of ASDF.

04.10.2019, 06:39, "Vladimir Sedach" <vas at oneofus.la>:
> Robert Goldman <rpgoldman at sift.info> writes:
>
>>>  - success should also be signaled, so we can distinguish a version
>>>  where
>>>    this new protocol is not implemented from the version where tests
>>>  pass
>>
>>  This requires a protocol where ASDF can "know" when the test op is
>>  done, so that it can distinguish "has succeeded" from "has not failed
>>  yet." It's possible that this could be managed by the test op on the
>>  system as a whole, but the design must consider the possibility that
>>  the test-op may be implemented so that:
>>
>>  1. Multiple calls are made into the test library (possibly even
>>  multiple test libraries are used)
>>  2. TEST-OP may involve invoking test operation on systems depended on
>>  (e.g., where a system has subsystems).
>
> There would be three situations here:
>
> 1. OPERATE TEST-OP returns with no relevant conditions signaled. You
> can infer that the condition protocol is not implemented.
> 2. OPERATE TEST-OP returns and one or more test failure conditions
> are signaled.
> 3. OPERATE TEST-OP returns and only test success conditions are
> signaled.
>
>>>  - the minimal requirement is a success / failure designator, the
>>>  failed
>>>    test names can be optional
>>
>>  - Additional requirement: the condition should support accessing both
>>  a long and a short form report. In degenerate implementations, these
>>  can, of course, be identical.
>
> What would the long and short form reports look like?
>
>>>    some corner cases. Still, even if we expect test results being
>>>  signalled
>>>    multiple times during a test-op, it would be good to provide a
>>>  wrapper
>>>    which aggregates them into a single return value.
>>>
>>>          (common-test-results:collect (asdf:test-system "my-system"))
>>
>>  This would require that the test library provide an extensible
>>  protocol for fusing together multiple test results.
>
> It is simpler than that: take all of the conditions, add the number
> of tests executed, and append all of the test failure lists. No need
> for library-specific code.
>
>>  And note that the above suggestion will not work, because ASDF does
>>  not ever *return* a value from operating. This has to do with the
>>  way ASDF creates a plan and then executes it. The plan doesn't
>>  support a notion of "return value," so the only way to get
>>  information out of ASDF is through conditions.
>
> What COMMON-TEST-RESULTS:COLLECT would do is handle and coalesce
> multiple conditions and re-signal a single condition.
>
>>  I agree -- I think `TRIVIAL-TEST-INTERFACE` might be a better first
>>  step. I suppose the alternative rationale is that a test interface
>>  that was *not* intended for incorporation into ASDF would be able to
>>  just *return* things, instead of *signaling* them.
>
> The point is, systems already define TEST-OP. I am trying to use
> that. The code that uses TEST-OP can do whatever it needs to, but the
> communication between that code and the test libraries has to be done
> by stack-based mechanisms like conditions or special variables.
>
>>  That is true, but it's also true that it would require special
>>  condition-handling to fit test results into continuous integration
>>  -- programmers would no longer be able to just use `quit-on-error`,
>>  which is a very handy way to turn a lisp test into something that
>>  works in Jenkins or, for that matter, any infrastructure based on
>>  shell scripting.
>
> Right now errors are not signaled on test failures in most
> definitions of TEST-OP I looked at, so this is not something that is
> currently going on. Neither is this something that would stop working
> for anyone that has the signal-error-on-failure flags set for their
> test library, or is throwing errors explicitly.
>
>>  I'd rather have to write code to handle errors when I *don't* want
>>  them, than have test failure not be an error.
>>
>>  If I'm running interactively, it's not a bother to deal with this as
>>  an error condition -- I can easily get out of the debugger. But
>>  writing a lot of code to catch `TEST-FAILURE` conditions and
>>  translate them into exit with non-zero status would be a pain.
>
> Test libraries already have flags whether to signal errors on test
> failures or not. Having the condition be a sub-class of error would
> not only be annoying in the REPL, it would break whatever test
> automation code uses these flags, and it would change the behavior of
> TEST-OP, most of whose callers do not expect it to signal errors on
> test failures right now. This is a lot of breakage of thousands of
> existing systems, just to avoid doing the following in a few test
> automation scripts:
>
> (handler-case
>     (asdf:test-system "some-system")
>   (asdf:test-op-test-failures (condition)
>     (princ condition uiop:*stderr*)
>     (uiop:quit 1)))
>
>>>  - slot for the failing asdf system could probably be avoided,
>>>    the list failed test names could be enough, if the names are
>>>  "fully qualified"
>>>    i.e. include package or system name.
>>
>>  I don't think we can make any assumptions about the above -- there's
>>  no rule about how a programmer can assign test names in a library
>>  like FiveAM to packages.
>
> FiveAM test names are symbols, so they already get printed with their
> package name in the implementation I did for FiveAM, without any
> extra work.
>
>>  I would note also that getting a new library into Quicklisp for this
>>  is going to be a lot easier than getting a new ASDF into Quicklisp:
>>  Xach has for years refused to update the ASDF version in Quicklisp,
>>  and I don't see any reason to believe this will change.
>
> As I mentioned before, I would like to avoid creating a whole library
> out of something that is a work-around to OPERATE not returning
> results.
>
> Unfortunately, it seems there are both social and technical problems
> with updating ASDF. In particular I do not see a good mechanism for
> advertising the availability of this condition protocol to test
> libraries (there does not seem to be an established way of
> advertising new ASDF functionality other than the :ASDF3.3 :ASDF3.2
> etc. keywords in *FEATURES*).
>
> As Anton pointed out, this necessitates the libraries signaling a
> condition for test success, which necessitates a function like
> COMMON-TEST-RESULTS:COLLECT. While writing an implementation of that
> function, I realized I would need to add continue restarts (the only
> way to handle a condition without affecting control flow).
>
> Take all of this together, and it becomes apparent that avoiding ASDF
> to provide a more useful TEST-OP is, ironically, the way to go. It is
> simpler to drive communications down the stack by binding special
> variables to act as accumulators, than it is communicating up the
> stack with signals, handlers, and restarts. Putting this into a
> library means TEST-OP would still retain a use as a way to trigger
> test runs without knowing details about either the tests or the test
> library (the first half of this proposal), but it will unfortunately
> mean that ASDF will continue to have no say about what the effects of
> TEST-OP are (the second half of this proposal).
>
> --
> Vladimir Sedach
> Software engineering services in Los Angeles https://oneofus.la