mirror of
git://projects.qi-hardware.com/wernermisc.git
synced 2024-11-23 18:10:19 +02:00
326cd33967
- TODO: various updates - pkg.h (enum relop): marked rel_lt as deprecated - prereq.c (resolve): instrumentation for chasing a dependencies bug
182 lines
6.2 KiB
Plaintext
182 lines
6.2 KiB
Plaintext
Open policy decisions
|
|
=====================
|
|
|
|
http://www.debian.org/doc/debian-policy/ch-controlfields.html#s5.6.10
|
|
|
|
- what to do about cyclic dependencies ?
|
|
|
|
Update: addressed in
|
|
http://www.debian.org/doc/debian-policy/ch-relationships.html#s-binarydeps
|
|
|
|
A cyclic dependency can be bad new or something perfectly normal,
|
|
depending on how we define the semantics of package A depending on
|
|
package B, and what policy we adopt with respect to the existence of
|
|
cyclic dependencies:
|
|
|
|
1) "B must be installed before A"
|
|
|
|
In this case, a cyclic dependency means that the package in
|
|
question cannot be installed using the respective sequence of
|
|
installations.
|
|
|
|
However, this does not mean that no other sequence can exist in which
|
|
the package could be installed.
|
|
|
|
Example:
|
|
|
|
A depends on B. There are two versions of B: B_0 depends on nothing
|
|
else while B_1 depends on A.
|
|
|
|
If we try to resolve A's dependency with B_1, we enter a circular
|
|
dependency and fail. If we use B_0 instead, there is no problem.
|
|
|
|
This means that there are (at least) the following three possible
|
|
policies:
|
|
|
|
1A) Cyclic dependencies are tolerated and just mean that the package
|
|
in question may not be installable (for whatever reason).
|
|
|
|
1B) A cyclic dependency is always considered an error.
|
|
|
|
1C) Cyclic dependencies are tolerated as long as there is a way around
|
|
them, as in the example above.
|
|
|
|
2) "B must be installed with A"
|
|
|
|
In this case, the cyclic dependency would not be a problem as long as
|
|
all the packages in the cycle are installed together.
|
|
|
|
Should an installation get interrupted and cause only part of the
|
|
packages to get installed, the system would then be in an anomalous
|
|
configuration.
|
|
|
|
If cyclic dependencies are to be interpreted this way, they are not a
|
|
problem per se. Policy may still discourage their use, though.
|
|
|
|
- what to do if we need something that's "provided" ?
|
|
|
|
Update: "Provides" is described here, but without answering the above
|
|
question:
|
|
http://www.debian.org/doc/debian-policy/ch-relationships.html#s-virtual
|
|
|
|
When determining prerequisites, we may encounter a dependency on an item
|
|
that only appears in the Provides: field of a package but is not an
|
|
installable package itself.
|
|
|
|
Should we
|
|
|
|
1) consider installing the package that provides the requested item, or
|
|
|
|
2) ignore the package, leaving it to the user to choose what to do.
|
|
|
|
3) if there's only one choice do 1) else do 2).
|
|
|
|
?
|
|
|
|
Policy 1 would make sense if this is merely an alias or if a package
|
|
enumerates its constituents, which at some point in time - in the past
|
|
or in the future - are separate packages.
|
|
|
|
Example:
|
|
|
|
- package "dwarf-pluto" could provide "planet-pluto", for packages that
|
|
haven't been updated yet,
|
|
|
|
- "binutils" could provide "as", "ld", etc., to allow packages that only
|
|
need specific parts to depend on them (with the option of breaking
|
|
binutils into its constituents in the future),
|
|
|
|
- similarly, if "as", "ld", etc., where individual packages in the past
|
|
but are now combined into "binutils", "binutils" could still provide
|
|
its constituents for compatibility with packages whose dependencies
|
|
have not been updated yet.
|
|
|
|
Policy 2 would seem more appropriate in the common case of multiple
|
|
choices.
|
|
|
|
|
|
Example:
|
|
|
|
- packages "emacs" and "vim" could both provide "editor", leaving the
|
|
choice to the user.
|
|
|
|
- similarly, message packages "foo-en", "foo-zh", etc., could both
|
|
provide "foo-messages".
|
|
|
|
In the above example, "Provides" could also be use to prioritize choices,
|
|
e.g., if "foo-en" provides "lang-en" and "foo-zh" provides "lang-zh",
|
|
future installations could prefer prerequisites that introduce fewer new
|
|
items. So a package "bar-en" providing "bar-messages" and "lang-en" would
|
|
be chosen over "bar-zh" providing "bar-messages" and "lang-zh" if we have
|
|
already installed "foo-en" but not "foo-zh" (or vice versa).
|
|
|
|
|
|
Still left to do
|
|
================
|
|
|
|
- make comp_versions work according to
|
|
http://www.debian.org/doc/debian-policy/ch-controlfields.html#s-f-Version
|
|
|
|
- consider reducing the size of the lists of conflicts, e.g., by making
|
|
them unique via a red-black tree
|
|
|
|
- handle Provides:
|
|
|
|
Update: Provides data is now parsed and properly integrated in the
|
|
package database, but not yet used to resolve prerequisites.
|
|
|
|
- sort prerequisites such that they can be installed in the specified order
|
|
|
|
- consider Architecture:
|
|
|
|
Update: we parse and record it now but don't use it yet.
|
|
|
|
- what to do with explicit and implicit replacement ?
|
|
|
|
- if we can't resolve the prerequisites, give at least a hint of what one
|
|
can do to improve the situation
|
|
|
|
- check database for internal consistency
|
|
|
|
Update: added detection of cyclic dependencies (in progress)
|
|
|
|
Update: added test for QPKG_ADDING cleanup bug
|
|
|
|
- implement keyword search
|
|
|
|
- consider also supporting the similar but not identical (parent ?) format
|
|
of /var/lib/dpkg/status and /var/lib/apt/lists/*Packages
|
|
|
|
Update: added as much as my Ubuntu system can reach before hitting |
|
|
|
|
|
|
Done
|
|
====
|
|
|
|
- optimize the search trees. Right now, we have 81812 calls to make_id
|
|
for 14601 packages, resulting in 7420560 calls to comp_id.
|
|
|
|
There can be at most 2 new identifiers per package (package name and
|
|
version), so a perfectly balanced tree should have a depth of no more
|
|
than 14. If we assume that each call to make_id searches to the bottom,
|
|
we'd get 1145368 calls to comp_id, about 15% of the current number.
|
|
|
|
So the tree is clearly degenerated.
|
|
|
|
Update: after switching to red-black trees, we get only 1497604 calls
|
|
to comp_id. This is 130% of the "good case" estimate above. Insertion
|
|
of a new node is currently done with two lookups, so we'll get rid of
|
|
some more lookups after further optimization.
|
|
|
|
Update: after merging the two lookups per new node into one, we're at
|
|
1172642 calls to comp_id, or 102% of the predicted "good case".
|
|
|
|
- if there are multiple choices, try to prefer more recent versions
|
|
|
|
- check whether introducing a new package would cause a conflict
|
|
|
|
Update: conflicts among the packages considered for installation are now
|
|
checked.
|
|
|
|
- compile the list of conflicts of installed packages
|