[asdf-devel] Pre ML post review
Faré
fahree at gmail.com
Thu Dec 17 03:18:11 UTC 2009
Dear ASDF developers,
Rommel and I are currently working on asdf bug #485918,
i.e. a plan for user-customizable persistent configuration of ASDF paths
through configuration files (duh).
https://bugs.launchpad.net/asdf/+bug/485918.
What follows is a detailed plan and several questions.
General Idea
============
The general idea is that in configuration files such as
/etc/common-lisp/source-registry.conf
~/.config/common-lisp/source-registry.conf
will be a SEXP in a trival domain-specific language
to specify a search path.
Backwards Compatibility
=======================
Now comes the issue of backwards compatibility with the current
asdf:*central-registry* that currently has to be manually configured
by users or supplemental layer (e.g. common-lisp-controller).
My proposal is that there would be a new (private) variable
common-lisp-configuration::*source-registry*
that would store a parsed version of the configuration
as an undocumented opaque data-structure
(except for the private use of ASDF and XCVB),
only accessible through API functions and said DSL.
One magic directive in that DSL, present by default in the configuration,
:default-registry
would hook into the implementation defaults, which would include
the searching of asdf:*central-registry* until it is eventually removed.
Uses of asdf:*central-registry* would be deprecated,
but still supported for many years to come for backwards compatibility.
The same DSL could be recognized by ASDF, XCVB, and any other future
build system for Common Lisp.
Alternatives I considered and rejected included:
1- Keep asdf:*central-registry* as the master with its current semantics,
and somehow the configuration parser expands the new configuration
language into a expanded series of directories of subdirectories to
lookup, pre-recursing through specified hierarchies. This is kludgy,
and leaves little space of future cleanups and extensions.
2- Keep asdf:*central-registry* remains the master but extend its semantics
in completely new ways, so that new kinds of entries may be implemented
as a recursive search, etc. This seems somewhat backwards.
3- Completely remove asdf:*central-registry* and break backwards compatibility.
Hopefully this will happen in a few years after everyone migrate to
a better ASDF and/or to XCVB, but it would be very bad to do it now.
4- Replace asdf:*central-registry* by a symbol-macro with appropriate magic
when you dereference it or setf it. Only the new variable with new
semantics is handled by the new search procedure.
Configuration DSL
=================
Here is the grammar of the SEXP DSL I am considering for configuration:
;; A configuration is single SEXP starting with keyword :source-registry
;; followed by a list of directives.
CONFIGURATION := (:source-registry DIRECTIVE ...)
;; A directive is one of the following:
DIRECTIVE :=
;; add a single directory to be scanned (no recursion)
(:directory DIRECTORY-PATHNAME-DESIGNATOR) |
;; add a directory hierarchy, recursing but excluding specified patterns
(:tree DIRECTORY-PATHNAME-DESIGNATOR &key exclude) |
;; override the default defaults for exclusion patterns
(:exclude-subdirectories PATTERN ...) |
;; splice the parsed contents of another config file
(:include-configuration REGULAR-FILE-PATHNAME-DESIGNATOR) |
;; Your configuration expression MUST have contain exactly one of these:
(:inherit-configuration) | ; splices contents of inherited configuration
(:ignore-inherited-configuration) ; drop contents of inherited configuration
;; This directive specifies that some default must be spliced.
(:default-registry)
PATTERN := a string without wildcards, that will be matched exactly
against the name of a subdirectory.
Configuration Files
===================
Following a suggestion by Stelian, the configuration should be read in
this order, each configuration containing an expression that may extend
or override the previous configuration:
file /etc/common-lisp/source-registry.conf
file ~/.config/common-lisp/source-registry.conf
environment variable CL_SOURCE_REGISTRY
some implementation- or application- specific command-line argument.
Equivalently (since the DSL doesn't allow for uncontrolled side-effects),
the outermost configuration only would be parsed,
and recursing to previous configurations would only happen
if and when specified.
This also leaves the question of whether environment variable and
command-line arguments should take the same SEXP syntax (to be READ),
or whether they should have some more shell-friendly textual
representation of the same information. Allowing for a different
representation is probably more work for marginally more feature,
and I don't recommend it. So, in the end, the same SEXP syntax
for shell variables and command-line arguments.
Configuration API
=================
This API is exported from package COMMON-LISP-CONFIGURATION.
(initialize-source-registry-configuration)
will read the configuration and initialize all internal variables,
and return the new configuration.
(clear-source-registry-configuration)
undoes any initialization. You might want to call that before you
dump an image that would be resumed with a different configuration,
and return an empty configuration.
Also will have a hook that allows clients will use to clear any cache
that depends on this configuration.
(ensure-source-registry-configuration)
checks an initial variable to see whether the state is initialized
or cleared. In the former case, return current configuration;
in the latter, initialize.
ASDF will call this function at the start of (asdf:find-system).
(process-source-registry-configuration X &optional inherit)
If X is a CONS, parse it as a SEXP in the configuration DSL,
and extend or override inheritted configuration.
If X is a STRING, first parse it into a SEXP with READ
(Alternate proposal: parse some shell-friendly text representation).
The inheritted configuration is provided in optional argument inherit,
itself a function that returns the previous configuration, with NIL
designating the default of #'ensure-source-registry-configuration.
Internally, initialize-source-registry-configuration can use this
with a series of functions for inheritted configuration.
Search Algorithm
================
* When a system is searched for, entries are processed in order.
* If a given entry has exactly one match, the search stops successfully.
If a given entry contains no match, it is skipped.
If a given entry contains multiple matches, an error is thrown.
* This later case does not change the semantics of ASDF in the case
where no recursion takes place, and ensures no undetected insanity
happens in the case where recursion is specified. XCVB has tested
this model, with success I believe.
* When an entry is first processed, the implementation may cache
the contents of the directory (i.e. all files that may match anything.)
until the cache is explicitly flushed (see below).
Cache flushing
==============
* The cache is flushed when function
(asdf:clear-system-search-cache)
is called.
* When the configuration is (re)loaded, the cache is flushed.
Questioned Niceties
===================
I've been suggested the below features, but have rejected them,
for the sake of keeping ASDF no more complex than strictly necessary.
* More syntactic sugar: synonyms for the configuration directives, such as
(:add-directory X) for (:directory X), or (:add-directory-hierarchy X)
or (:add-directory X :recurse t) for (:tree X).
* The possibility to register individual files instead of directories.
* Integrate Xach Beane's tilde expander into the parser,
or something similar that is shell-friendly or shell-compatible.
I'd rather keep ASDF minimal. But maybe this precisely keeps it
minimal by removing the need for evaluated entries that ASDF has?
i.e. uses of USER-HOMEDIR-PATHNAME and $SBCL_HOME
Hopefully, these are already superseded by the :default-registry
* We may allow a shell-friendly colon-separated text-based syntax
for environment variables and command-line arguments.
If we only accept directories, not files, then all provided strings should
be considered with implicit / at the end before being parsed by
parse-namestring or such. An explicit // (as in TEXINPUTS) or /**
(as in zsh or some (non-standard) CL pathname extensions, but
requiring painful quotes from the shell) could specify recursion.
Additional Notes
================
With this specification, we can share configuration between ASDF and XCVB,
and maintain reasonable backwards compatibility in ASDF.
[ François-René ÐVB Rideau | Reflection&Cybernethics | http://fare.tunes.org ]
Just because your semi-free country government is evil doesn't mean "native"
governments have a right to exist and enslave "their" people. — Faré
More information about the asdf-devel
mailing list