2010-11-23 01:27:49 +02:00
|
|
|
Open policy decisions
|
|
|
|
=====================
|
|
|
|
|
2011-07-23 17:45:24 +03:00
|
|
|
http://www.debian.org/doc/debian-policy/ch-controlfields.html#s5.6.10
|
|
|
|
|
2010-11-23 01:27:49 +02:00
|
|
|
- what to do about cyclic dependencies ?
|
|
|
|
|
2011-07-23 17:45:24 +03:00
|
|
|
Update: addressed in
|
|
|
|
http://www.debian.org/doc/debian-policy/ch-relationships.html#s-binarydeps
|
|
|
|
|
2010-11-23 01:27:49 +02:00
|
|
|
A cyclic dependency can be bad new or something perfectly normal,
|
|
|
|
depending on how we define the semantics of package A depending on
|
|
|
|
package B, and what policy we adopt with respect to the existence of
|
|
|
|
cyclic dependencies:
|
|
|
|
|
|
|
|
1) "B must be installed before A"
|
|
|
|
|
|
|
|
In this case, a cyclic dependency means that the package in
|
|
|
|
question cannot be installed using the respective sequence of
|
|
|
|
installations.
|
|
|
|
|
|
|
|
However, this does not mean that no other sequence can exist in which
|
|
|
|
the package could be installed.
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
A depends on B. There are two versions of B: B_0 depends on nothing
|
|
|
|
else while B_1 depends on A.
|
|
|
|
|
|
|
|
If we try to resolve A's dependency with B_1, we enter a circular
|
|
|
|
dependency and fail. If we use B_0 instead, there is no problem.
|
|
|
|
|
|
|
|
This means that there are (at least) the following three possible
|
|
|
|
policies:
|
|
|
|
|
|
|
|
1A) Cyclic dependencies are tolerated and just mean that the package
|
|
|
|
in question may not be installable (for whatever reason).
|
|
|
|
|
|
|
|
1B) A cyclic dependency is always considered an error.
|
|
|
|
|
|
|
|
1C) Cyclic dependencies are tolerated as long as there is a way around
|
|
|
|
them, as in the example above.
|
|
|
|
|
|
|
|
2) "B must be installed with A"
|
|
|
|
|
|
|
|
In this case, the cyclic dependency would not be a problem as long as
|
|
|
|
all the packages in the cycle are installed together.
|
|
|
|
|
|
|
|
Should an installation get interrupted and cause only part of the
|
|
|
|
packages to get installed, the system would then be in an anomalous
|
|
|
|
configuration.
|
|
|
|
|
|
|
|
If cyclic dependencies are to be interpreted this way, they are not a
|
|
|
|
problem per se. Policy may still discourage their use, though.
|
|
|
|
|
|
|
|
- what to do if we need something that's "provided" ?
|
|
|
|
|
2011-07-23 17:45:24 +03:00
|
|
|
Update: "Provides" is described here, but without answering the above
|
|
|
|
question:
|
|
|
|
http://www.debian.org/doc/debian-policy/ch-relationships.html#s-virtual
|
|
|
|
|
2010-11-23 01:27:49 +02:00
|
|
|
When determining prerequisites, we may encounter a dependency on an item
|
2011-07-23 17:45:24 +03:00
|
|
|
that only appears in the Provides: field of a package but is not an
|
2010-11-23 01:27:49 +02:00
|
|
|
installable package itself.
|
|
|
|
|
|
|
|
Should we
|
|
|
|
|
|
|
|
1) consider installing the package that provides the requested item, or
|
|
|
|
|
|
|
|
2) ignore the package, leaving it to the user to choose what to do.
|
|
|
|
|
|
|
|
3) if there's only one choice do 1) else do 2).
|
|
|
|
|
|
|
|
?
|
|
|
|
|
|
|
|
Policy 1 would make sense if this is merely an alias or if a package
|
|
|
|
enumerates its constituents, which at some point in time - in the past
|
|
|
|
or in the future - are separate packages.
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
- package "dwarf-pluto" could provide "planet-pluto", for packages that
|
|
|
|
haven't been updated yet,
|
|
|
|
|
|
|
|
- "binutils" could provide "as", "ld", etc., to allow packages that only
|
|
|
|
need specific parts to depend on them (with the option of breaking
|
|
|
|
binutils into its constituents in the future),
|
|
|
|
|
|
|
|
- similarly, if "as", "ld", etc., where individual packages in the past
|
|
|
|
but are now combined into "binutils", "binutils" could still provide
|
|
|
|
its constituents for compatibility with packages whose dependencies
|
|
|
|
have not been updated yet.
|
|
|
|
|
|
|
|
Policy 2 would seem more appropriate in the common case of multiple
|
|
|
|
choices.
|
|
|
|
|
|
|
|
|
|
|
|
Example:
|
|
|
|
|
|
|
|
- packages "emacs" and "vim" could both provide "editor", leaving the
|
|
|
|
choice to the user.
|
|
|
|
|
|
|
|
- similarly, message packages "foo-en", "foo-zh", etc., could both
|
|
|
|
provide "foo-messages".
|
|
|
|
|
|
|
|
In the above example, "Provides" could also be use to prioritize choices,
|
|
|
|
e.g., if "foo-en" provides "lang-en" and "foo-zh" provides "lang-zh",
|
|
|
|
future installations could prefer prerequisites that introduce fewer new
|
|
|
|
items. So a package "bar-en" providing "bar-messages" and "lang-en" would
|
|
|
|
be chosen over "bar-zh" providing "bar-messages" and "lang-zh" if we have
|
|
|
|
already installed "foo-en" but not "foo-zh" (or vice versa).
|
|
|
|
|
|
|
|
|
2010-11-20 12:18:44 +02:00
|
|
|
Still left to do
|
|
|
|
================
|
2010-11-20 03:20:15 +02:00
|
|
|
|
2011-07-23 17:45:24 +03:00
|
|
|
- make comp_versions work according to
|
|
|
|
http://www.debian.org/doc/debian-policy/ch-controlfields.html#s-f-Version
|
|
|
|
|
2010-11-22 04:51:35 +02:00
|
|
|
- consider reducing the size of the lists of conflicts, e.g., by making
|
|
|
|
them unique via a red-black tree
|
2010-11-20 00:16:00 +02:00
|
|
|
|
|
|
|
- handle Provides:
|
|
|
|
|
2010-11-23 00:30:09 +02:00
|
|
|
Update: Provides data is now parsed and properly integrated in the
|
|
|
|
package database, but not yet used to resolve prerequisites.
|
|
|
|
|
2010-11-20 00:16:00 +02:00
|
|
|
- sort prerequisites such that they can be installed in the specified order
|
|
|
|
|
|
|
|
- consider Architecture:
|
|
|
|
|
qpkg: added detection of cyclic dependencies
We define a cyclic dependency as the possibility (!) of the prerequisites
of a package X including a package that depends on X, and issue an error
if encountering such a situation.
Note that, if the dependencies of X can be resolved in a manner that does
not include the cyclic dependency, qpkg will still fail if it encounters
the cycle. Also note that qpkg (at least so far) does no perform an
exhaustive search to ferret out cyclic dependencies.
Furthermore, we don't consider that a cyclic dependency may not necessarily
imply a real life problem. E.g., if a package A contains elements X and
Y, with X needing package B, and the content of package B has a run-time
dependency on Y, the cyclic dependency between A and B would not exist
when considering its constituents. Since we don't have this information, we
just err on the side of caution.
- qpkg.h (enum flags): divide flags into categories (parse-time and
run-time) and add flag QPKG_ADDING to mark packets whose dependencies we
are processing
- prereq.c (resolve, prereq): track which packages we're tentatively
considering for installation and detect cyclic dependencies
- test/cyclic: regression test for detection of cyclic dependencies
- TODO: updated with recent changes
2010-11-22 01:31:20 +02:00
|
|
|
Update: we parse and record it now but don't use it yet.
|
|
|
|
|
2010-11-20 00:16:00 +02:00
|
|
|
- what to do with explicit and implicit replacement ?
|
|
|
|
|
|
|
|
- if we can't resolve the prerequisites, give at least a hint of what one
|
|
|
|
can do to improve the situation
|
|
|
|
|
|
|
|
- check database for internal consistency
|
|
|
|
|
qpkg: added detection of cyclic dependencies
We define a cyclic dependency as the possibility (!) of the prerequisites
of a package X including a package that depends on X, and issue an error
if encountering such a situation.
Note that, if the dependencies of X can be resolved in a manner that does
not include the cyclic dependency, qpkg will still fail if it encounters
the cycle. Also note that qpkg (at least so far) does no perform an
exhaustive search to ferret out cyclic dependencies.
Furthermore, we don't consider that a cyclic dependency may not necessarily
imply a real life problem. E.g., if a package A contains elements X and
Y, with X needing package B, and the content of package B has a run-time
dependency on Y, the cyclic dependency between A and B would not exist
when considering its constituents. Since we don't have this information, we
just err on the side of caution.
- qpkg.h (enum flags): divide flags into categories (parse-time and
run-time) and add flag QPKG_ADDING to mark packets whose dependencies we
are processing
- prereq.c (resolve, prereq): track which packages we're tentatively
considering for installation and detect cyclic dependencies
- test/cyclic: regression test for detection of cyclic dependencies
- TODO: updated with recent changes
2010-11-22 01:31:20 +02:00
|
|
|
Update: added detection of cyclic dependencies (in progress)
|
|
|
|
|
2010-11-22 14:28:34 +02:00
|
|
|
Update: added test for QPKG_ADDING cleanup bug
|
|
|
|
|
2010-11-20 00:16:00 +02:00
|
|
|
- implement keyword search
|
2010-11-20 12:18:44 +02:00
|
|
|
|
|
|
|
- consider also supporting the similar but not identical (parent ?) format
|
|
|
|
of /var/lib/dpkg/status and /var/lib/apt/lists/*Packages
|
|
|
|
|
qpkg: added some field names and relational operators from Ubuntu
- gobble.c (gobble_buf): recognize (and ignore) the following fields:
Bugs, Installed-Size, MD5sum, Origin, Original-Maintainer, SHA1, SHA256,
Task.
- pkg.h (enum relop): added comments showing the respective symbol
- pkg.h (enum relop): added rel_gg (>>), rel_le (<=), and rel_ll (<<)
- gobble.c (gobble_buf): recognize >>, <=, and <<
- TODO: record progress in support beyond Jlime/OE
2010-11-23 03:48:28 +02:00
|
|
|
Update: added as much as my Ubuntu system can reach before hitting |
|
|
|
|
|
2010-11-20 12:18:44 +02:00
|
|
|
|
|
|
|
Done
|
|
|
|
====
|
|
|
|
|
|
|
|
- optimize the search trees. Right now, we have 81812 calls to make_id
|
|
|
|
for 14601 packages, resulting in 7420560 calls to comp_id.
|
|
|
|
|
|
|
|
There can be at most 2 new identifiers per package (package name and
|
|
|
|
version), so a perfectly balanced tree should have a depth of no more
|
|
|
|
than 14. If we assume that each call to make_id searches to the bottom,
|
|
|
|
we'd get 1145368 calls to comp_id, about 15% of the current number.
|
|
|
|
|
|
|
|
So the tree is clearly degenerated.
|
|
|
|
|
|
|
|
Update: after switching to red-black trees, we get only 1497604 calls
|
|
|
|
to comp_id. This is 130% of the "good case" estimate above. Insertion
|
|
|
|
of a new node is currently done with two lookups, so we'll get rid of
|
|
|
|
some more lookups after further optimization.
|
|
|
|
|
|
|
|
Update: after merging the two lookups per new node into one, we're at
|
|
|
|
1172642 calls to comp_id, or 102% of the predicted "good case".
|
2010-11-22 04:02:10 +02:00
|
|
|
|
|
|
|
- if there are multiple choices, try to prefer more recent versions
|
2010-11-22 04:51:35 +02:00
|
|
|
|
|
|
|
- check whether introducing a new package would cause a conflict
|
|
|
|
|
|
|
|
Update: conflicts among the packages considered for installation are now
|
|
|
|
checked.
|
|
|
|
|
|
|
|
- compile the list of conflicts of installed packages
|