224 lines
11 KiB
ReStructuredText
224 lines
11 KiB
ReStructuredText
|
.. _development_early_stage:
|
||
|
|
||
|
Early-stage planning
|
||
|
====================
|
||
|
|
||
|
When contemplating a Linux kernel development project, it can be tempting
|
||
|
to jump right in and start coding. As with any significant project,
|
||
|
though, much of the groundwork for success is best laid before the first
|
||
|
line of code is written. Some time spent in early planning and
|
||
|
communication can save far more time later on.
|
||
|
|
||
|
|
||
|
Specifying the problem
|
||
|
----------------------
|
||
|
|
||
|
Like any engineering project, a successful kernel enhancement starts with a
|
||
|
clear description of the problem to be solved. In some cases, this step is
|
||
|
easy: when a driver is needed for a specific piece of hardware, for
|
||
|
example. In others, though, it is tempting to confuse the real problem
|
||
|
with the proposed solution, and that can lead to difficulties.
|
||
|
|
||
|
Consider an example: some years ago, developers working with Linux audio
|
||
|
sought a way to run applications without dropouts or other artifacts caused
|
||
|
by excessive latency in the system. The solution they arrived at was a
|
||
|
kernel module intended to hook into the Linux Security Module (LSM)
|
||
|
framework; this module could be configured to give specific applications
|
||
|
access to the realtime scheduler. This module was implemented and sent to
|
||
|
the linux-kernel mailing list, where it immediately ran into problems.
|
||
|
|
||
|
To the audio developers, this security module was sufficient to solve their
|
||
|
immediate problem. To the wider kernel community, though, it was seen as a
|
||
|
misuse of the LSM framework (which is not intended to confer privileges
|
||
|
onto processes which they would not otherwise have) and a risk to system
|
||
|
stability. Their preferred solutions involved realtime scheduling access
|
||
|
via the rlimit mechanism for the short term, and ongoing latency reduction
|
||
|
work in the long term.
|
||
|
|
||
|
The audio community, however, could not see past the particular solution
|
||
|
they had implemented; they were unwilling to accept alternatives. The
|
||
|
resulting disagreement left those developers feeling disillusioned with the
|
||
|
entire kernel development process; one of them went back to an audio list
|
||
|
and posted this:
|
||
|
|
||
|
There are a number of very good Linux kernel developers, but they
|
||
|
tend to get outshouted by a large crowd of arrogant fools. Trying
|
||
|
to communicate user requirements to these people is a waste of
|
||
|
time. They are much too "intelligent" to listen to lesser mortals.
|
||
|
|
||
|
(https://lwn.net/Articles/131776/).
|
||
|
|
||
|
The reality of the situation was different; the kernel developers were far
|
||
|
more concerned about system stability, long-term maintenance, and finding
|
||
|
the right solution to the problem than they were with a specific module.
|
||
|
The moral of the story is to focus on the problem - not a specific solution
|
||
|
- and to discuss it with the development community before investing in the
|
||
|
creation of a body of code.
|
||
|
|
||
|
So, when contemplating a kernel development project, one should obtain
|
||
|
answers to a short set of questions:
|
||
|
|
||
|
- What, exactly, is the problem which needs to be solved?
|
||
|
|
||
|
- Who are the users affected by this problem? Which use cases should the
|
||
|
solution address?
|
||
|
|
||
|
- How does the kernel fall short in addressing that problem now?
|
||
|
|
||
|
Only then does it make sense to start considering possible solutions.
|
||
|
|
||
|
|
||
|
Early discussion
|
||
|
----------------
|
||
|
|
||
|
When planning a kernel development project, it makes great sense to hold
|
||
|
discussions with the community before launching into implementation. Early
|
||
|
communication can save time and trouble in a number of ways:
|
||
|
|
||
|
- It may well be that the problem is addressed by the kernel in ways which
|
||
|
you have not understood. The Linux kernel is large and has a number of
|
||
|
features and capabilities which are not immediately obvious. Not all
|
||
|
kernel capabilities are documented as well as one might like, and it is
|
||
|
easy to miss things. Your author has seen the posting of a complete
|
||
|
driver which duplicated an existing driver that the new author had been
|
||
|
unaware of. Code which reinvents existing wheels is not only wasteful;
|
||
|
it will also not be accepted into the mainline kernel.
|
||
|
|
||
|
- There may be elements of the proposed solution which will not be
|
||
|
acceptable for mainline merging. It is better to find out about
|
||
|
problems like this before writing the code.
|
||
|
|
||
|
- It's entirely possible that other developers have thought about the
|
||
|
problem; they may have ideas for a better solution, and may be willing
|
||
|
to help in the creation of that solution.
|
||
|
|
||
|
Years of experience with the kernel development community have taught a
|
||
|
clear lesson: kernel code which is designed and developed behind closed
|
||
|
doors invariably has problems which are only revealed when the code is
|
||
|
released into the community. Sometimes these problems are severe,
|
||
|
requiring months or years of effort before the code can be brought up to
|
||
|
the kernel community's standards. Some examples include:
|
||
|
|
||
|
- The Devicescape network stack was designed and implemented for
|
||
|
single-processor systems. It could not be merged into the mainline
|
||
|
until it was made suitable for multiprocessor systems. Retrofitting
|
||
|
locking and such into code is a difficult task; as a result, the merging
|
||
|
of this code (now called mac80211) was delayed for over a year.
|
||
|
|
||
|
- The Reiser4 filesystem included a number of capabilities which, in the
|
||
|
core kernel developers' opinion, should have been implemented in the
|
||
|
virtual filesystem layer instead. It also included features which could
|
||
|
not easily be implemented without exposing the system to user-caused
|
||
|
deadlocks. The late revelation of these problems - and refusal to
|
||
|
address some of them - has caused Reiser4 to stay out of the mainline
|
||
|
kernel.
|
||
|
|
||
|
- The AppArmor security module made use of internal virtual filesystem
|
||
|
data structures in ways which were considered to be unsafe and
|
||
|
unreliable. This concern (among others) kept AppArmor out of the
|
||
|
mainline for years.
|
||
|
|
||
|
In each of these cases, a great deal of pain and extra work could have been
|
||
|
avoided with some early discussion with the kernel developers.
|
||
|
|
||
|
|
||
|
Who do you talk to?
|
||
|
-------------------
|
||
|
|
||
|
When developers decide to take their plans public, the next question will
|
||
|
be: where do we start? The answer is to find the right mailing list(s) and
|
||
|
the right maintainer. For mailing lists, the best approach is to look in
|
||
|
the MAINTAINERS file for a relevant place to post. If there is a suitable
|
||
|
subsystem list, posting there is often preferable to posting on
|
||
|
linux-kernel; you are more likely to reach developers with expertise in the
|
||
|
relevant subsystem and the environment may be more supportive.
|
||
|
|
||
|
Finding maintainers can be a bit harder. Again, the MAINTAINERS file is
|
||
|
the place to start. That file tends to not always be up to date, though,
|
||
|
and not all subsystems are represented there. The person listed in the
|
||
|
MAINTAINERS file may, in fact, not be the person who is actually acting in
|
||
|
that role currently. So, when there is doubt about who to contact, a
|
||
|
useful trick is to use git (and "git log" in particular) to see who is
|
||
|
currently active within the subsystem of interest. Look at who is writing
|
||
|
patches, and who, if anybody, is attaching Signed-off-by lines to those
|
||
|
patches. Those are the people who will be best placed to help with a new
|
||
|
development project.
|
||
|
|
||
|
The task of finding the right maintainer is sometimes challenging enough
|
||
|
that the kernel developers have added a script to ease the process:
|
||
|
|
||
|
::
|
||
|
|
||
|
.../scripts/get_maintainer.pl
|
||
|
|
||
|
This script will return the current maintainer(s) for a given file or
|
||
|
directory when given the "-f" option. If passed a patch on the
|
||
|
command line, it will list the maintainers who should probably receive
|
||
|
copies of the patch. This is the preferred way (unlike "-f" option) to get the
|
||
|
list of people to Cc for your patches. There are a number of options
|
||
|
regulating how hard get_maintainer.pl will search for maintainers; please be
|
||
|
careful about using the more aggressive options as you may end up including
|
||
|
developers who have no real interest in the code you are modifying.
|
||
|
|
||
|
If all else fails, talking to Andrew Morton can be an effective way to
|
||
|
track down a maintainer for a specific piece of code.
|
||
|
|
||
|
|
||
|
When to post?
|
||
|
-------------
|
||
|
|
||
|
If possible, posting your plans during the early stages can only be
|
||
|
helpful. Describe the problem being solved and any plans that have been
|
||
|
made on how the implementation will be done. Any information you can
|
||
|
provide can help the development community provide useful input on the
|
||
|
project.
|
||
|
|
||
|
One discouraging thing which can happen at this stage is not a hostile
|
||
|
reaction, but, instead, little or no reaction at all. The sad truth of the
|
||
|
matter is (1) kernel developers tend to be busy, (2) there is no shortage
|
||
|
of people with grand plans and little code (or even prospect of code) to
|
||
|
back them up, and (3) nobody is obligated to review or comment on ideas
|
||
|
posted by others. Beyond that, high-level designs often hide problems
|
||
|
which are only revealed when somebody actually tries to implement those
|
||
|
designs; for that reason, kernel developers would rather see the code.
|
||
|
|
||
|
If a request-for-comments posting yields little in the way of comments, do
|
||
|
not assume that it means there is no interest in the project.
|
||
|
Unfortunately, you also cannot assume that there are no problems with your
|
||
|
idea. The best thing to do in this situation is to proceed, keeping the
|
||
|
community informed as you go.
|
||
|
|
||
|
|
||
|
Getting official buy-in
|
||
|
-----------------------
|
||
|
|
||
|
If your work is being done in a corporate environment - as most Linux
|
||
|
kernel work is - you must, obviously, have permission from suitably
|
||
|
empowered managers before you can post your company's plans or code to a
|
||
|
public mailing list. The posting of code which has not been cleared for
|
||
|
release under a GPL-compatible license can be especially problematic; the
|
||
|
sooner that a company's management and legal staff can agree on the posting
|
||
|
of a kernel development project, the better off everybody involved will be.
|
||
|
|
||
|
Some readers may be thinking at this point that their kernel work is
|
||
|
intended to support a product which does not yet have an officially
|
||
|
acknowledged existence. Revealing their employer's plans on a public
|
||
|
mailing list may not be a viable option. In cases like this, it is worth
|
||
|
considering whether the secrecy is really necessary; there is often no real
|
||
|
need to keep development plans behind closed doors.
|
||
|
|
||
|
That said, there are also cases where a company legitimately cannot
|
||
|
disclose its plans early in the development process. Companies with
|
||
|
experienced kernel developers may choose to proceed in an open-loop manner
|
||
|
on the assumption that they will be able to avoid serious integration
|
||
|
problems later. For companies without that sort of in-house expertise, the
|
||
|
best option is often to hire an outside developer to review the plans under
|
||
|
a non-disclosure agreement. The Linux Foundation operates an NDA program
|
||
|
designed to help with this sort of situation; more information can be found
|
||
|
at:
|
||
|
|
||
|
https://www.linuxfoundation.org/nda/
|
||
|
|
||
|
This kind of review is often enough to avoid serious problems later on
|
||
|
without requiring public disclosure of the project.
|