-
Notifications
You must be signed in to change notification settings - Fork 864
devel CreateFramework
This wiki page goes through the basics of creating a new Open MPI Modular Component Architecture (MCA) framework. It is assumed that the reader already has at least some familiarity with the internals of Open MPI.
NOTE: This page describes the most recent version of Open MPI's "autogen" process. For older versions, refer to the archived versions of this wiki page:
- Note that OMPI's top-level "autogen.pl" is pivotal to the discovery, configuration, and building of components in the Open MPI source code tree. Be sure to see the autogen.pl wiki page for details about the role of autogen.pl.
- How to create a new MCA component.
As you'll see below, a framework is solely contained within a single directory tree. As such, removing a framework (''and all of its components'') is accomplished via "rm -rf" of the framework directory tree, and then re-running autogen.pl and configure.
This assumes that you have also removed all references to the framework from the rest of the OMPI code base, of course!
There are a few different kinds of frameworks that can be created; this document walks through creating the simplest (and most common): a framework that builds all of its components statically and/or dynamically and chooses which one (or more) to load/use at run-time.
- Pick a framework name. It must obey the following guidelines:
- Be valid as a C variable name (no spaces/punctuation other than "_", start with a letter, etc.)
- Be all lower case
- Be unique across all other MCA frameworks (even those frameworks in different Open MPI architecture levels -- e.g., you can't have a "foo" framework in both the OPAL and ORTE layers)
- Is at least somewhat descriptive (be friendly to your fellow developers!)
- Create a directory with the framework name in /mca/. For the purposes of this document, we'll assume that your framework name is "foo" in the "ompi" project.
- In the ompi/mca/foo directory, create a file named "foo.h".
- This file defines the public interface for all components and modules the foo framework. Specifically, all components should be able to #include "ompi/mca/foo/foo.h" and have all the types declared for all component and module interfaces.
- Use standard protect-against-multiple-include stuff, such as
#ifndef OMPI_MCA_FOO_H
#define OMPI_MCA_FOO_H
/* ...body of the file... */
#endif
- In the ompi/mca/foo directory, create a file named Makefile.am. It is usually easiest to copy a Makefile.am from another framework directory and then change every instance of "that_framework_name" in the Makefile.am to "foo". For the purposes of this document, let's assume that you copied source: trunk/opal/mca/paffinity/Makefile.am and changed all instances of "paffinity" to "foo".
A framework may optionally have a text file named configure.m4 in its top-level directory (e.g., "ompi/mca/foo/configure.m4"). Currently, only two actions are possible in this file:
- Set the configure mode for all components in the framework
- Define some shell code that will be executed in OMPI's top-level configure before evaluating each component in the framework to see if they want to be built.
One or both of the actions can be specified.
The default configure mode for frameworks is to evaluate each component that is found and query to see if it wants to build (no ordering guarantees are provided; Open MPI will evaluate each component in some random order). If it does, the component's directory is added to the list of directories to traverse when building Open MPI. If the component does not want to be built, it is ignored/skipped in the build process.
This default behavior can be changed to one of three other modes if desired, but only if all components in the framework use the configure.m4 method for configuring (see devel-CreateComponent for details):
-
STOP_AT_FIRST
: When evaluating components to see if they want to build, the first component that returns "yes, build me!" will cause OMPI's top-level configure to ignore all the rest of the components in that framework. As such, a framework that specifiesSTOP_AT_FIRST
will have either 0 or 1 components to build. -
STOP_AT_FIRST_PRIORITY
: When evaluating components to see if they want to build, all components with the same priority as the first component to successfully configure will be built. All components of lower priority than the first to succeed will not be allowed to succeed. This is useful for frameworks like ess, where a specific component succeeding indicates that we're in a special environment (like a lightweight OS) and will never need other ess components with lower priority. -
PRIORITY
: Components for the framework will be configured in priority order, with no special selection logic. Users may add their own logic by threading through specific environment variables from configure macro to configure macro. -
DEFAULT
: No special handling of components or order is specified. Being the default, this is rarely used explicitly.
When STOP_AT_FIRST_PRIORITY
, STOP_AT_FIRST
, or PRIORITY
are used, all components must use the configure.m4 method of configuring themselves, and must set an m4 macro named MCA_<project>_<framework>_<component>_PRIORITY
to be an integer value in the range of [0, 100]. Higher priority is better for precedence rules discussed above.
Changing the mode is accomplished by setting the m4 macro named
MCA_<framework>_COMPILE_MODE
. For example:
m4_define(MCA_ompi_foo_CONFIGURE_MODE, STOP_AT_FIRST)
If the framework's configure.m4 file defines an m4 macro named
MCA_<project>_<framework>_CONFIG
, this macro is executed instead of
the OMPI configure main "engine" for evaluating all the components in
the framework.
The MCA_<project>_<framework>_CONFIG
macro is passed two
parameters:
- $1 is the name of the project
- $2 is the name of the framework
The MCA_<project>__<framework>_CONFIG
macro can generally
contain any valid Autoconf / Automake Bourne shell code and macros,
but it must invoke MCA_CONFIGURE_FRAMEWORK($1, $2,
allow_succeed), where "allow_succeed" must be either 0 or 1:
- 0: evaluate all the components, but don't allow any of them to be compiled. This is useful when a framework wants to disable itself, but still needs to go through the motion of calling MCA_CONFIGURE_FRAMEWORK.
- 1: evaluate all the components normally (i.e., some may succeed, some may fail).
In the ompi/mca/foo directory, create a directory named "base". This directory is not a component, but rather all the "glue" code for the framework itself. Note that this code will be compiled into the main library itself (e.g., libopen-pal.so, libopen-rte.so, or libmpi.so). As such, the base directory must create a Libtool convenience library named "libmca_.la" (libmca_foo.la, in this example) that will be included in the upper-level project library.
- It is customary to have a "base.h" file in
ompi/mca/foo/base that contains the "public" functions,
types, etc., for that framework (i.e., "public" meaning "things
that code outside of this framework is allowed to invoke and
use").
- Note that some newer frameworks have started using "public.h" instead of "base.h".
- You need a Makefile.am in the ompi/mca/foo/base directory as well. Again, it may be easiest to copy this file from another framework (e.g., source: trunk/opal/mca/paffinity/base/Makefile.am) and replace all instances of that framework's name with "foo" (e.g., replace "paffinity" with "foo") as a starting template.
- NOTE: The base/Makefile.am file is not a standalone Makefile.am file; it is included by ompi/mca/foo/Makefile.am. Although most frameworks have historically named their base file "Makefile.am", the more "modern" naming methodology is to name the file "Makefile.include". You may need to adjust ompi/mca/foo/Makefile.am to include "Makefile.include" instead of "Makefile.am".
- Edit the Makefile.include to list the source files that you put in the base directory. All .c source files must follow the prefix rule (i.e,. be named "foo_base_.c") so that there will not be .o filename collisions within libraries. Also, since Makefile.include is included, you need to list all the source files relative to the framework top directory. For example, you need to list "foo/base.h" and "foo/foo_base_open.c" (vs. "base.h" and "foo_base_open.c").
- Frameworks need, at a minimum, "open" and "close" functions.
- The framework's "open" function finds and opens components of that framework type; see source: trunk/opal/mca/paffinity/base/paffinity_base_open.c for an example.
- The framework's "close" function closes all components that were previously opened in that framework; see source: trunk/opal/mca/paffinity/base/paffinity_base_close.c for an example.
- Frameworks usually need some type of "select" function as well-- choosing which (if any) of the components that were successfully opened will be used at run-time. Sometimes a framework will only allow one component to be used during a run (e.g., OPAL's paffinity framework); sometimes a framework will allow multiple components to be used during a run (e.g., OMPI's BTL framework). It is up to the framework to decide what its selection policies are. As an example, several frameworks that choose only one component at run-time use "priority"-based selection policy; each component that is able to be successfully opened and accessed returns a numeric priority from 0 to 100. The component with the highest priority "wins" and is used in the job. The others are all closed.
- A framework may choose to put other functionality in the base as well. The general rule of thumb is that if more than one component in that framework will need functionality X, then put a function that performs X in the framework's base.
- Remember that all symbols in the base -- global variables and functions -- must obey the prefix rule. Hence, they all must be prefixed with either "mca_foo_" or "_foo" (where "" is usually "opal", "orte", or "ompi") both are acceptable, although the latter has become more popular recently).
- Some decisions that a framework needs to make:
- What will its component selection policies be? (see above)
- How will the selected components be invoked by the rest of
the code base? The two most common approaches are:
- For frameworks that only select one component at run-time, put all the function pointers to the module in a global struct named "_foo" that contains a function pointer for every module method. The rest of the OMPI code base then invokes the selected module's methods via "_foo.method_name(...)."
- For frameworks that select multiple components to use at runtime, provide public "wrapper" functions in the framework base that dispatch off to the selected modules at runtime, and typically exposes these functions through its "base.h" header file. In this way, the framework hides its components and modules from the rest of the code base (which is good for preserving abstraction barriers).
- Remember that components do not share code with each other. The only way for multiple components to interact is to use common code in their framework's base or elsewhere in the library. Failure to obey this rule will be swiftly and unmercifully punished by the linker.
Once the base is complete, you need to make one or more components. See the devel-CreateComponent wiki page for more details.
- You need to call your framework's open, close, and select functions from the appropriate project startup and shutdown functions (these functions should be located in the framework base/ directory). Depending on which project your framework is in, the appropriate .c files you'll need to edit are listed below. Insert the call to _foo_open() and _foo_close() in the appropriate location in these files. Be sure to obey framework open / close order dependencies (the close sequence should likely be the opposite of the open sequence):
- Finally, you need to tell ompi_info about
your framework. Work is in progress to make this automatic, but
for now, you need to hand-edit the ompi_info source code as
follows (all in the source: trunk/ompi/tools/ompi_info
directory):
- In [source:trunk/ompi/tools/ompi_info/ompi_info.c, add a "opal_pointer_array_add" call for your framework name (look for all the other "opal_pointer_array_add" calls with framework names; it's fairly obvious). Please insert your framework with all the other frameworks in your project.
- Add the appropriate framework header files to source: trunk/ompi/tools/ompi_info/components.c.
- In source: trunk/ompi/tools/ompi_info/components.c ompi_info_open_components(), add call to _foo_open() and save the public list of components that was opened (this list should be maintained by a public symbol in your framework's base). Be sure to obey framework open order dependencies.
- In source: trunk/ompi/tools/ompi_info/components.c ompi_info_close_components(), add call to _foo_close(). Be sure to obey framework close order dependencies (it should typically be the opposite order of open).
OMPI's master branch relies on the PMIx Reference RunTime Environment or "PRRTE" project for its "mpirun" launcher program. The PRRTE command line accepts MCA parameters in two forms:
-
project-specific params. There are three projects (PMIx, PRRTE, and OMPI) active within an OMPI job, and each of them utilize the MCA infrastructure. You can specify that a particular param targets a given project by using the corresponding cmd line option. For example, PMIx params can be indicated with the "--pmixmca foo bar" syntax, while PRRTE params would utilize the "--prtemca foo bar" syntax. Likewise, OMPI params are indicated by the "--omca foo bar" option.
-
generic params. If the generic "--mca foo bar" syntax is used, then PRRTE will attempt to "do the right thing" by looking up the "foo" framework in its project-specific components that parse the cmd line. This is done on a per-framework basis - i.e., PRRTE will look for the "foo" framework in a lookup table in each project's component, and associate the param with the project where that framework entry is found.
It is therefore important that you "register" your framework with PRRTE so that generic param references can be identified and properly supported. This is done by adding the framework name to the schizo/ompi component - you'll find the existing frameworks already in the table, so simply adding an entry for your new framework is all that is required.