| ==================== | 
 | Writing an LLVM Pass | 
 | ==================== | 
 |  | 
 | .. contents:: | 
 |     :local: | 
 |  | 
 | Introduction --- What is a pass? | 
 | ================================ | 
 |  | 
 | The LLVM Pass Framework is an important part of the LLVM system, because LLVM | 
 | passes are where most of the interesting parts of the compiler exist.  Passes | 
 | perform the transformations and optimizations that make up the compiler, they | 
 | build the analysis results that are used by these transformations, and they | 
 | are, above all, a structuring technique for compiler code. | 
 |  | 
 | All LLVM passes are subclasses of the `Pass | 
 | <http://llvm.org/doxygen/classllvm_1_1Pass.html>`_ class, which implement | 
 | functionality by overriding virtual methods inherited from ``Pass``.  Depending | 
 | on how your pass works, you should inherit from the :ref:`ModulePass | 
 | <writing-an-llvm-pass-ModulePass>` , :ref:`CallGraphSCCPass | 
 | <writing-an-llvm-pass-CallGraphSCCPass>`, :ref:`FunctionPass | 
 | <writing-an-llvm-pass-FunctionPass>` , or :ref:`LoopPass | 
 | <writing-an-llvm-pass-LoopPass>`, or :ref:`RegionPass | 
 | <writing-an-llvm-pass-RegionPass>`, or :ref:`BasicBlockPass | 
 | <writing-an-llvm-pass-BasicBlockPass>` classes, which gives the system more | 
 | information about what your pass does, and how it can be combined with other | 
 | passes.  One of the main features of the LLVM Pass Framework is that it | 
 | schedules passes to run in an efficient way based on the constraints that your | 
 | pass meets (which are indicated by which class they derive from). | 
 |  | 
 | We start by showing you how to construct a pass, everything from setting up the | 
 | code, to compiling, loading, and executing it.  After the basics are down, more | 
 | advanced features are discussed. | 
 |  | 
 | Quick Start --- Writing hello world | 
 | =================================== | 
 |  | 
 | Here we describe how to write the "hello world" of passes.  The "Hello" pass is | 
 | designed to simply print out the name of non-external functions that exist in | 
 | the program being compiled.  It does not modify the program at all, it just | 
 | inspects it.  The source code and files for this pass are available in the LLVM | 
 | source tree in the ``lib/Transforms/Hello`` directory. | 
 |  | 
 | .. _writing-an-llvm-pass-makefile: | 
 |  | 
 | Setting up the build environment | 
 | -------------------------------- | 
 |  | 
 | First, configure and build LLVM.  Next, you need to create a new directory | 
 | somewhere in the LLVM source base.  For this example, we'll assume that you | 
 | made ``lib/Transforms/Hello``.  Finally, you must set up a build script | 
 | that will compile the source code for the new pass.  To do this, | 
 | copy the following into ``CMakeLists.txt``: | 
 |  | 
 | .. code-block:: cmake | 
 |  | 
 |   add_llvm_loadable_module( LLVMHello | 
 |     Hello.cpp | 
 |    | 
 |     PLUGIN_TOOL | 
 |     opt | 
 |     ) | 
 |  | 
 | and the following line into ``lib/Transforms/CMakeLists.txt``: | 
 |  | 
 | .. code-block:: cmake | 
 |  | 
 |   add_subdirectory(Hello) | 
 |  | 
 | (Note that there is already a directory named ``Hello`` with a sample "Hello" | 
 | pass; you may play with it -- in which case you don't need to modify any | 
 | ``CMakeLists.txt`` files -- or, if you want to create everything from scratch, | 
 | use another name.) | 
 |  | 
 | This build script specifies that ``Hello.cpp`` file in the current directory | 
 | is to be compiled and linked into a shared object ``$(LEVEL)/lib/LLVMHello.so`` that | 
 | can be dynamically loaded by the :program:`opt` tool via its :option:`-load` | 
 | option. If your operating system uses a suffix other than ``.so`` (such as | 
 | Windows or Mac OS X), the appropriate extension will be used. | 
 |  | 
 | Now that we have the build scripts set up, we just need to write the code for | 
 | the pass itself. | 
 |  | 
 | .. _writing-an-llvm-pass-basiccode: | 
 |  | 
 | Basic code required | 
 | ------------------- | 
 |  | 
 | Now that we have a way to compile our new pass, we just have to write it. | 
 | Start out with: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   #include "llvm/Pass.h" | 
 |   #include "llvm/IR/Function.h" | 
 |   #include "llvm/Support/raw_ostream.h" | 
 |  | 
 | Which are needed because we are writing a `Pass | 
 | <http://llvm.org/doxygen/classllvm_1_1Pass.html>`_, we are operating on | 
 | `Function <http://llvm.org/doxygen/classllvm_1_1Function.html>`_\ s, and we will | 
 | be doing some printing. | 
 |  | 
 | Next we have: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   using namespace llvm; | 
 |  | 
 | ... which is required because the functions from the include files live in the | 
 | llvm namespace. | 
 |  | 
 | Next we have: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   namespace { | 
 |  | 
 | ... which starts out an anonymous namespace.  Anonymous namespaces are to C++ | 
 | what the "``static``" keyword is to C (at global scope).  It makes the things | 
 | declared inside of the anonymous namespace visible only to the current file. | 
 | If you're not familiar with them, consult a decent C++ book for more | 
 | information. | 
 |  | 
 | Next, we declare our pass itself: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   struct Hello : public FunctionPass { | 
 |  | 
 | This declares a "``Hello``" class that is a subclass of :ref:`FunctionPass | 
 | <writing-an-llvm-pass-FunctionPass>`.  The different builtin pass subclasses | 
 | are described in detail :ref:`later <writing-an-llvm-pass-pass-classes>`, but | 
 | for now, know that ``FunctionPass`` operates on a function at a time. | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |     static char ID; | 
 |     Hello() : FunctionPass(ID) {} | 
 |  | 
 | This declares pass identifier used by LLVM to identify pass.  This allows LLVM | 
 | to avoid using expensive C++ runtime information. | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |     bool runOnFunction(Function &F) override { | 
 |       errs() << "Hello: "; | 
 |       errs().write_escaped(F.getName()) << '\n'; | 
 |       return false; | 
 |     } | 
 |   }; // end of struct Hello | 
 |   }  // end of anonymous namespace | 
 |  | 
 | We declare a :ref:`runOnFunction <writing-an-llvm-pass-runOnFunction>` method, | 
 | which overrides an abstract virtual method inherited from :ref:`FunctionPass | 
 | <writing-an-llvm-pass-FunctionPass>`.  This is where we are supposed to do our | 
 | thing, so we just print out our message with the name of each function. | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   char Hello::ID = 0; | 
 |  | 
 | We initialize pass ID here.  LLVM uses ID's address to identify a pass, so | 
 | initialization value is not important. | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   static RegisterPass<Hello> X("hello", "Hello World Pass", | 
 |                                false /* Only looks at CFG */, | 
 |                                false /* Analysis Pass */); | 
 |  | 
 | Lastly, we :ref:`register our class <writing-an-llvm-pass-registration>` | 
 | ``Hello``, giving it a command line argument "``hello``", and a name "Hello | 
 | World Pass".  The last two arguments describe its behavior: if a pass walks CFG | 
 | without modifying it then the third argument is set to ``true``; if a pass is | 
 | an analysis pass, for example dominator tree pass, then ``true`` is supplied as | 
 | the fourth argument. | 
 |  | 
 | As a whole, the ``.cpp`` file looks like: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   #include "llvm/Pass.h" | 
 |   #include "llvm/IR/Function.h" | 
 |   #include "llvm/Support/raw_ostream.h" | 
 |    | 
 |   using namespace llvm; | 
 |    | 
 |   namespace { | 
 |   struct Hello : public FunctionPass { | 
 |     static char ID; | 
 |     Hello() : FunctionPass(ID) {} | 
 |    | 
 |     bool runOnFunction(Function &F) override { | 
 |       errs() << "Hello: "; | 
 |       errs().write_escaped(F.getName()) << '\n'; | 
 |       return false; | 
 |     } | 
 |   }; // end of struct Hello | 
 |   }  // end of anonymous namespace | 
 |    | 
 |   char Hello::ID = 0; | 
 |   static RegisterPass<Hello> X("hello", "Hello World Pass", | 
 |                                false /* Only looks at CFG */, | 
 |                                false /* Analysis Pass */); | 
 |  | 
 | Now that it's all together, compile the file with a simple "``gmake``" command | 
 | from the top level of your build directory and you should get a new file | 
 | "``lib/LLVMHello.so``".  Note that everything in this file is | 
 | contained in an anonymous namespace --- this reflects the fact that passes | 
 | are self contained units that do not need external interfaces (although they | 
 | can have them) to be useful. | 
 |  | 
 | Running a pass with ``opt`` | 
 | --------------------------- | 
 |  | 
 | Now that you have a brand new shiny shared object file, we can use the | 
 | :program:`opt` command to run an LLVM program through your pass.  Because you | 
 | registered your pass with ``RegisterPass``, you will be able to use the | 
 | :program:`opt` tool to access it, once loaded. | 
 |  | 
 | To test it, follow the example at the end of the :doc:`GettingStarted` to | 
 | compile "Hello World" to LLVM.  We can now run the bitcode file (hello.bc) for | 
 | the program through our transformation like this (or course, any bitcode file | 
 | will work): | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -hello < hello.bc > /dev/null | 
 |   Hello: __main | 
 |   Hello: puts | 
 |   Hello: main | 
 |  | 
 | The :option:`-load` option specifies that :program:`opt` should load your pass | 
 | as a shared object, which makes "``-hello``" a valid command line argument | 
 | (which is one reason you need to :ref:`register your pass | 
 | <writing-an-llvm-pass-registration>`).  Because the Hello pass does not modify | 
 | the program in any interesting way, we just throw away the result of | 
 | :program:`opt` (sending it to ``/dev/null``). | 
 |  | 
 | To see what happened to the other string you registered, try running | 
 | :program:`opt` with the :option:`-help` option: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -help | 
 |   OVERVIEW: llvm .bc -> .bc modular optimizer and analysis printer | 
 |  | 
 |   USAGE: opt [subcommand] [options] <input bitcode file> | 
 |  | 
 |   OPTIONS: | 
 |     Optimizations available: | 
 |   ... | 
 |       -guard-widening           - Widen guards | 
 |       -gvn                      - Global Value Numbering | 
 |       -gvn-hoist                - Early GVN Hoisting of Expressions | 
 |       -hello                    - Hello World Pass | 
 |       -indvars                  - Induction Variable Simplification | 
 |       -inferattrs               - Infer set function attributes | 
 |   ... | 
 |  | 
 | The pass name gets added as the information string for your pass, giving some | 
 | documentation to users of :program:`opt`.  Now that you have a working pass, | 
 | you would go ahead and make it do the cool transformations you want.  Once you | 
 | get it all working and tested, it may become useful to find out how fast your | 
 | pass is.  The :ref:`PassManager <writing-an-llvm-pass-passmanager>` provides a | 
 | nice command line option (:option:`--time-passes`) that allows you to get | 
 | information about the execution time of your pass along with the other passes | 
 | you queue up.  For example: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -hello -time-passes < hello.bc > /dev/null | 
 |   Hello: __main | 
 |   Hello: puts | 
 |   Hello: main | 
 |   ===-------------------------------------------------------------------------=== | 
 |                         ... Pass execution timing report ... | 
 |   ===-------------------------------------------------------------------------=== | 
 |     Total Execution Time: 0.0007 seconds (0.0005 wall clock) | 
 |    | 
 |      ---User Time---   --User+System--   ---Wall Time---  --- Name --- | 
 |      0.0004 ( 55.3%)   0.0004 ( 55.3%)   0.0004 ( 75.7%)  Bitcode Writer | 
 |      0.0003 ( 44.7%)   0.0003 ( 44.7%)   0.0001 ( 13.6%)  Hello World Pass | 
 |      0.0000 (  0.0%)   0.0000 (  0.0%)   0.0001 ( 10.7%)  Module Verifier | 
 |      0.0007 (100.0%)   0.0007 (100.0%)   0.0005 (100.0%)  Total | 
 |  | 
 | As you can see, our implementation above is pretty fast.  The additional | 
 | passes listed are automatically inserted by the :program:`opt` tool to verify | 
 | that the LLVM emitted by your pass is still valid and well formed LLVM, which | 
 | hasn't been broken somehow. | 
 |  | 
 | Now that you have seen the basics of the mechanics behind passes, we can talk | 
 | about some more details of how they work and how to use them. | 
 |  | 
 | .. _writing-an-llvm-pass-pass-classes: | 
 |  | 
 | Pass classes and requirements | 
 | ============================= | 
 |  | 
 | One of the first things that you should do when designing a new pass is to | 
 | decide what class you should subclass for your pass.  The :ref:`Hello World | 
 | <writing-an-llvm-pass-basiccode>` example uses the :ref:`FunctionPass | 
 | <writing-an-llvm-pass-FunctionPass>` class for its implementation, but we did | 
 | not discuss why or when this should occur.  Here we talk about the classes | 
 | available, from the most general to the most specific. | 
 |  | 
 | When choosing a superclass for your ``Pass``, you should choose the **most | 
 | specific** class possible, while still being able to meet the requirements | 
 | listed.  This gives the LLVM Pass Infrastructure information necessary to | 
 | optimize how passes are run, so that the resultant compiler isn't unnecessarily | 
 | slow. | 
 |  | 
 | The ``ImmutablePass`` class | 
 | --------------------------- | 
 |  | 
 | The most plain and boring type of pass is the "`ImmutablePass | 
 | <http://llvm.org/doxygen/classllvm_1_1ImmutablePass.html>`_" class.  This pass | 
 | type is used for passes that do not have to be run, do not change state, and | 
 | never need to be updated.  This is not a normal type of transformation or | 
 | analysis, but can provide information about the current compiler configuration. | 
 |  | 
 | Although this pass class is very infrequently used, it is important for | 
 | providing information about the current target machine being compiled for, and | 
 | other static information that can affect the various transformations. | 
 |  | 
 | ``ImmutablePass``\ es never invalidate other transformations, are never | 
 | invalidated, and are never "run". | 
 |  | 
 | .. _writing-an-llvm-pass-ModulePass: | 
 |  | 
 | The ``ModulePass`` class | 
 | ------------------------ | 
 |  | 
 | The `ModulePass <http://llvm.org/doxygen/classllvm_1_1ModulePass.html>`_ class | 
 | is the most general of all superclasses that you can use.  Deriving from | 
 | ``ModulePass`` indicates that your pass uses the entire program as a unit, | 
 | referring to function bodies in no predictable order, or adding and removing | 
 | functions.  Because nothing is known about the behavior of ``ModulePass`` | 
 | subclasses, no optimization can be done for their execution. | 
 |  | 
 | A module pass can use function level passes (e.g. dominators) using the | 
 | ``getAnalysis`` interface ``getAnalysis<DominatorTree>(llvm::Function *)`` to | 
 | provide the function to retrieve analysis result for, if the function pass does | 
 | not require any module or immutable passes.  Note that this can only be done | 
 | for functions for which the analysis ran, e.g. in the case of dominators you | 
 | should only ask for the ``DominatorTree`` for function definitions, not | 
 | declarations. | 
 |  | 
 | To write a correct ``ModulePass`` subclass, derive from ``ModulePass`` and | 
 | overload the ``runOnModule`` method with the following signature: | 
 |  | 
 | The ``runOnModule`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnModule(Module &M) = 0; | 
 |  | 
 | The ``runOnModule`` method performs the interesting work of the pass.  It | 
 | should return ``true`` if the module was modified by the transformation and | 
 | ``false`` otherwise. | 
 |  | 
 | .. _writing-an-llvm-pass-CallGraphSCCPass: | 
 |  | 
 | The ``CallGraphSCCPass`` class | 
 | ------------------------------ | 
 |  | 
 | The `CallGraphSCCPass | 
 | <http://llvm.org/doxygen/classllvm_1_1CallGraphSCCPass.html>`_ is used by | 
 | passes that need to traverse the program bottom-up on the call graph (callees | 
 | before callers).  Deriving from ``CallGraphSCCPass`` provides some mechanics | 
 | for building and traversing the ``CallGraph``, but also allows the system to | 
 | optimize execution of ``CallGraphSCCPass``\ es.  If your pass meets the | 
 | requirements outlined below, and doesn't meet the requirements of a | 
 | :ref:`FunctionPass <writing-an-llvm-pass-FunctionPass>` or :ref:`BasicBlockPass | 
 | <writing-an-llvm-pass-BasicBlockPass>`, you should derive from | 
 | ``CallGraphSCCPass``. | 
 |  | 
 | ``TODO``: explain briefly what SCC, Tarjan's algo, and B-U mean. | 
 |  | 
 | To be explicit, CallGraphSCCPass subclasses are: | 
 |  | 
 | #. ... *not allowed* to inspect or modify any ``Function``\ s other than those | 
 |    in the current SCC and the direct callers and direct callees of the SCC. | 
 | #. ... *required* to preserve the current ``CallGraph`` object, updating it to | 
 |    reflect any changes made to the program. | 
 | #. ... *not allowed* to add or remove SCC's from the current Module, though | 
 |    they may change the contents of an SCC. | 
 | #. ... *allowed* to add or remove global variables from the current Module. | 
 | #. ... *allowed* to maintain state across invocations of :ref:`runOnSCC | 
 |    <writing-an-llvm-pass-runOnSCC>` (including global data). | 
 |  | 
 | Implementing a ``CallGraphSCCPass`` is slightly tricky in some cases because it | 
 | has to handle SCCs with more than one node in it.  All of the virtual methods | 
 | described below should return ``true`` if they modified the program, or | 
 | ``false`` if they didn't. | 
 |  | 
 | The ``doInitialization(CallGraph &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doInitialization(CallGraph &CG); | 
 |  | 
 | The ``doInitialization`` method is allowed to do most of the things that | 
 | ``CallGraphSCCPass``\ es are not allowed to do.  They can add and remove | 
 | functions, get pointers to functions, etc.  The ``doInitialization`` method is | 
 | designed to do simple initialization type of stuff that does not depend on the | 
 | SCCs being processed.  The ``doInitialization`` method call is not scheduled to | 
 | overlap with any other pass executions (thus it should be very fast). | 
 |  | 
 | .. _writing-an-llvm-pass-runOnSCC: | 
 |  | 
 | The ``runOnSCC`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnSCC(CallGraphSCC &SCC) = 0; | 
 |  | 
 | The ``runOnSCC`` method performs the interesting work of the pass, and should | 
 | return ``true`` if the module was modified by the transformation, ``false`` | 
 | otherwise. | 
 |  | 
 | The ``doFinalization(CallGraph &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doFinalization(CallGraph &CG); | 
 |  | 
 | The ``doFinalization`` method is an infrequently used method that is called | 
 | when the pass framework has finished calling :ref:`runOnSCC | 
 | <writing-an-llvm-pass-runOnSCC>` for every SCC in the program being compiled. | 
 |  | 
 | .. _writing-an-llvm-pass-FunctionPass: | 
 |  | 
 | The ``FunctionPass`` class | 
 | -------------------------- | 
 |  | 
 | In contrast to ``ModulePass`` subclasses, `FunctionPass | 
 | <http://llvm.org/doxygen/classllvm_1_1Pass.html>`_ subclasses do have a | 
 | predictable, local behavior that can be expected by the system.  All | 
 | ``FunctionPass`` execute on each function in the program independent of all of | 
 | the other functions in the program.  ``FunctionPass``\ es do not require that | 
 | they are executed in a particular order, and ``FunctionPass``\ es do not modify | 
 | external functions. | 
 |  | 
 | To be explicit, ``FunctionPass`` subclasses are not allowed to: | 
 |  | 
 | #. Inspect or modify a ``Function`` other than the one currently being processed. | 
 | #. Add or remove ``Function``\ s from the current ``Module``. | 
 | #. Add or remove global variables from the current ``Module``. | 
 | #. Maintain state across invocations of :ref:`runOnFunction | 
 |    <writing-an-llvm-pass-runOnFunction>` (including global data). | 
 |  | 
 | Implementing a ``FunctionPass`` is usually straightforward (See the :ref:`Hello | 
 | World <writing-an-llvm-pass-basiccode>` pass for example). | 
 | ``FunctionPass``\ es may overload three virtual methods to do their work.  All | 
 | of these methods should return ``true`` if they modified the program, or | 
 | ``false`` if they didn't. | 
 |  | 
 | .. _writing-an-llvm-pass-doInitialization-mod: | 
 |  | 
 | The ``doInitialization(Module &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doInitialization(Module &M); | 
 |  | 
 | The ``doInitialization`` method is allowed to do most of the things that | 
 | ``FunctionPass``\ es are not allowed to do.  They can add and remove functions, | 
 | get pointers to functions, etc.  The ``doInitialization`` method is designed to | 
 | do simple initialization type of stuff that does not depend on the functions | 
 | being processed.  The ``doInitialization`` method call is not scheduled to | 
 | overlap with any other pass executions (thus it should be very fast). | 
 |  | 
 | A good example of how this method should be used is the `LowerAllocations | 
 | <http://llvm.org/doxygen/LowerAllocations_8cpp-source.html>`_ pass.  This pass | 
 | converts ``malloc`` and ``free`` instructions into platform dependent | 
 | ``malloc()`` and ``free()`` function calls.  It uses the ``doInitialization`` | 
 | method to get a reference to the ``malloc`` and ``free`` functions that it | 
 | needs, adding prototypes to the module if necessary. | 
 |  | 
 | .. _writing-an-llvm-pass-runOnFunction: | 
 |  | 
 | The ``runOnFunction`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnFunction(Function &F) = 0; | 
 |  | 
 | The ``runOnFunction`` method must be implemented by your subclass to do the | 
 | transformation or analysis work of your pass.  As usual, a ``true`` value | 
 | should be returned if the function is modified. | 
 |  | 
 | .. _writing-an-llvm-pass-doFinalization-mod: | 
 |  | 
 | The ``doFinalization(Module &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doFinalization(Module &M); | 
 |  | 
 | The ``doFinalization`` method is an infrequently used method that is called | 
 | when the pass framework has finished calling :ref:`runOnFunction | 
 | <writing-an-llvm-pass-runOnFunction>` for every function in the program being | 
 | compiled. | 
 |  | 
 | .. _writing-an-llvm-pass-LoopPass: | 
 |  | 
 | The ``LoopPass`` class | 
 | ---------------------- | 
 |  | 
 | All ``LoopPass`` execute on each loop in the function independent of all of the | 
 | other loops in the function.  ``LoopPass`` processes loops in loop nest order | 
 | such that outer most loop is processed last. | 
 |  | 
 | ``LoopPass`` subclasses are allowed to update loop nest using ``LPPassManager`` | 
 | interface.  Implementing a loop pass is usually straightforward. | 
 | ``LoopPass``\ es may overload three virtual methods to do their work.  All | 
 | these methods should return ``true`` if they modified the program, or ``false`` | 
 | if they didn't. | 
 |  | 
 | A ``LoopPass`` subclass which is intended to run as part of the main loop pass | 
 | pipeline needs to preserve all of the same *function* analyses that the other | 
 | loop passes in its pipeline require. To make that easier, | 
 | a ``getLoopAnalysisUsage`` function is provided by ``LoopUtils.h``. It can be | 
 | called within the subclass's ``getAnalysisUsage`` override to get consistent | 
 | and correct behavior. Analogously, ``INITIALIZE_PASS_DEPENDENCY(LoopPass)`` | 
 | will initialize this set of function analyses. | 
 |  | 
 | The ``doInitialization(Loop *, LPPassManager &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doInitialization(Loop *, LPPassManager &LPM); | 
 |  | 
 | The ``doInitialization`` method is designed to do simple initialization type of | 
 | stuff that does not depend on the functions being processed.  The | 
 | ``doInitialization`` method call is not scheduled to overlap with any other | 
 | pass executions (thus it should be very fast).  ``LPPassManager`` interface | 
 | should be used to access ``Function`` or ``Module`` level analysis information. | 
 |  | 
 | .. _writing-an-llvm-pass-runOnLoop: | 
 |  | 
 | The ``runOnLoop`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnLoop(Loop *, LPPassManager &LPM) = 0; | 
 |  | 
 | The ``runOnLoop`` method must be implemented by your subclass to do the | 
 | transformation or analysis work of your pass.  As usual, a ``true`` value | 
 | should be returned if the function is modified.  ``LPPassManager`` interface | 
 | should be used to update loop nest. | 
 |  | 
 | The ``doFinalization()`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doFinalization(); | 
 |  | 
 | The ``doFinalization`` method is an infrequently used method that is called | 
 | when the pass framework has finished calling :ref:`runOnLoop | 
 | <writing-an-llvm-pass-runOnLoop>` for every loop in the program being compiled. | 
 |  | 
 | .. _writing-an-llvm-pass-RegionPass: | 
 |  | 
 | The ``RegionPass`` class | 
 | ------------------------ | 
 |  | 
 | ``RegionPass`` is similar to :ref:`LoopPass <writing-an-llvm-pass-LoopPass>`, | 
 | but executes on each single entry single exit region in the function. | 
 | ``RegionPass`` processes regions in nested order such that the outer most | 
 | region is processed last. | 
 |  | 
 | ``RegionPass`` subclasses are allowed to update the region tree by using the | 
 | ``RGPassManager`` interface.  You may overload three virtual methods of | 
 | ``RegionPass`` to implement your own region pass.  All these methods should | 
 | return ``true`` if they modified the program, or ``false`` if they did not. | 
 |  | 
 | The ``doInitialization(Region *, RGPassManager &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doInitialization(Region *, RGPassManager &RGM); | 
 |  | 
 | The ``doInitialization`` method is designed to do simple initialization type of | 
 | stuff that does not depend on the functions being processed.  The | 
 | ``doInitialization`` method call is not scheduled to overlap with any other | 
 | pass executions (thus it should be very fast).  ``RPPassManager`` interface | 
 | should be used to access ``Function`` or ``Module`` level analysis information. | 
 |  | 
 | .. _writing-an-llvm-pass-runOnRegion: | 
 |  | 
 | The ``runOnRegion`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnRegion(Region *, RGPassManager &RGM) = 0; | 
 |  | 
 | The ``runOnRegion`` method must be implemented by your subclass to do the | 
 | transformation or analysis work of your pass.  As usual, a true value should be | 
 | returned if the region is modified.  ``RGPassManager`` interface should be used to | 
 | update region tree. | 
 |  | 
 | The ``doFinalization()`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doFinalization(); | 
 |  | 
 | The ``doFinalization`` method is an infrequently used method that is called | 
 | when the pass framework has finished calling :ref:`runOnRegion | 
 | <writing-an-llvm-pass-runOnRegion>` for every region in the program being | 
 | compiled. | 
 |  | 
 | .. _writing-an-llvm-pass-BasicBlockPass: | 
 |  | 
 | The ``BasicBlockPass`` class | 
 | ---------------------------- | 
 |  | 
 | ``BasicBlockPass``\ es are just like :ref:`FunctionPass's | 
 | <writing-an-llvm-pass-FunctionPass>` , except that they must limit their scope | 
 | of inspection and modification to a single basic block at a time.  As such, | 
 | they are **not** allowed to do any of the following: | 
 |  | 
 | #. Modify or inspect any basic blocks outside of the current one. | 
 | #. Maintain state across invocations of :ref:`runOnBasicBlock | 
 |    <writing-an-llvm-pass-runOnBasicBlock>`. | 
 | #. Modify the control flow graph (by altering terminator instructions) | 
 | #. Any of the things forbidden for :ref:`FunctionPasses | 
 |    <writing-an-llvm-pass-FunctionPass>`. | 
 |  | 
 | ``BasicBlockPass``\ es are useful for traditional local and "peephole" | 
 | optimizations.  They may override the same :ref:`doInitialization(Module &) | 
 | <writing-an-llvm-pass-doInitialization-mod>` and :ref:`doFinalization(Module &) | 
 | <writing-an-llvm-pass-doFinalization-mod>` methods that :ref:`FunctionPass's | 
 | <writing-an-llvm-pass-FunctionPass>` have, but also have the following virtual | 
 | methods that may also be implemented: | 
 |  | 
 | The ``doInitialization(Function &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool doInitialization(Function &F); | 
 |  | 
 | The ``doInitialization`` method is allowed to do most of the things that | 
 | ``BasicBlockPass``\ es are not allowed to do, but that ``FunctionPass``\ es | 
 | can.  The ``doInitialization`` method is designed to do simple initialization | 
 | that does not depend on the ``BasicBlock``\ s being processed.  The | 
 | ``doInitialization`` method call is not scheduled to overlap with any other | 
 | pass executions (thus it should be very fast). | 
 |  | 
 | .. _writing-an-llvm-pass-runOnBasicBlock: | 
 |  | 
 | The ``runOnBasicBlock`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnBasicBlock(BasicBlock &BB) = 0; | 
 |  | 
 | Override this function to do the work of the ``BasicBlockPass``.  This function | 
 | is not allowed to inspect or modify basic blocks other than the parameter, and | 
 | are not allowed to modify the CFG.  A ``true`` value must be returned if the | 
 | basic block is modified. | 
 |  | 
 | The ``doFinalization(Function &)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |     virtual bool doFinalization(Function &F); | 
 |  | 
 | The ``doFinalization`` method is an infrequently used method that is called | 
 | when the pass framework has finished calling :ref:`runOnBasicBlock | 
 | <writing-an-llvm-pass-runOnBasicBlock>` for every ``BasicBlock`` in the program | 
 | being compiled.  This can be used to perform per-function finalization. | 
 |  | 
 | The ``MachineFunctionPass`` class | 
 | --------------------------------- | 
 |  | 
 | A ``MachineFunctionPass`` is a part of the LLVM code generator that executes on | 
 | the machine-dependent representation of each LLVM function in the program. | 
 |  | 
 | Code generator passes are registered and initialized specially by | 
 | ``TargetMachine::addPassesToEmitFile`` and similar routines, so they cannot | 
 | generally be run from the :program:`opt` or :program:`bugpoint` commands. | 
 |  | 
 | A ``MachineFunctionPass`` is also a ``FunctionPass``, so all the restrictions | 
 | that apply to a ``FunctionPass`` also apply to it.  ``MachineFunctionPass``\ es | 
 | also have additional restrictions.  In particular, ``MachineFunctionPass``\ es | 
 | are not allowed to do any of the following: | 
 |  | 
 | #. Modify or create any LLVM IR ``Instruction``\ s, ``BasicBlock``\ s, | 
 |    ``Argument``\ s, ``Function``\ s, ``GlobalVariable``\ s, | 
 |    ``GlobalAlias``\ es, or ``Module``\ s. | 
 | #. Modify a ``MachineFunction`` other than the one currently being processed. | 
 | #. Maintain state across invocations of :ref:`runOnMachineFunction | 
 |    <writing-an-llvm-pass-runOnMachineFunction>` (including global data). | 
 |  | 
 | .. _writing-an-llvm-pass-runOnMachineFunction: | 
 |  | 
 | The ``runOnMachineFunction(MachineFunction &MF)`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual bool runOnMachineFunction(MachineFunction &MF) = 0; | 
 |  | 
 | ``runOnMachineFunction`` can be considered the main entry point of a | 
 | ``MachineFunctionPass``; that is, you should override this method to do the | 
 | work of your ``MachineFunctionPass``. | 
 |  | 
 | The ``runOnMachineFunction`` method is called on every ``MachineFunction`` in a | 
 | ``Module``, so that the ``MachineFunctionPass`` may perform optimizations on | 
 | the machine-dependent representation of the function.  If you want to get at | 
 | the LLVM ``Function`` for the ``MachineFunction`` you're working on, use | 
 | ``MachineFunction``'s ``getFunction()`` accessor method --- but remember, you | 
 | may not modify the LLVM ``Function`` or its contents from a | 
 | ``MachineFunctionPass``. | 
 |  | 
 | .. _writing-an-llvm-pass-registration: | 
 |  | 
 | Pass registration | 
 | ----------------- | 
 |  | 
 | In the :ref:`Hello World <writing-an-llvm-pass-basiccode>` example pass we | 
 | illustrated how pass registration works, and discussed some of the reasons that | 
 | it is used and what it does.  Here we discuss how and why passes are | 
 | registered. | 
 |  | 
 | As we saw above, passes are registered with the ``RegisterPass`` template.  The | 
 | template parameter is the name of the pass that is to be used on the command | 
 | line to specify that the pass should be added to a program (for example, with | 
 | :program:`opt` or :program:`bugpoint`).  The first argument is the name of the | 
 | pass, which is to be used for the :option:`-help` output of programs, as well | 
 | as for debug output generated by the `--debug-pass` option. | 
 |  | 
 | If you want your pass to be easily dumpable, you should implement the virtual | 
 | print method: | 
 |  | 
 | The ``print`` method | 
 | ^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual void print(llvm::raw_ostream &O, const Module *M) const; | 
 |  | 
 | The ``print`` method must be implemented by "analyses" in order to print a | 
 | human readable version of the analysis results.  This is useful for debugging | 
 | an analysis itself, as well as for other people to figure out how an analysis | 
 | works.  Use the opt ``-analyze`` argument to invoke this method. | 
 |  | 
 | The ``llvm::raw_ostream`` parameter specifies the stream to write the results | 
 | on, and the ``Module`` parameter gives a pointer to the top level module of the | 
 | program that has been analyzed.  Note however that this pointer may be ``NULL`` | 
 | in certain circumstances (such as calling the ``Pass::dump()`` from a | 
 | debugger), so it should only be used to enhance debug output, it should not be | 
 | depended on. | 
 |  | 
 | .. _writing-an-llvm-pass-interaction: | 
 |  | 
 | Specifying interactions between passes | 
 | -------------------------------------- | 
 |  | 
 | One of the main responsibilities of the ``PassManager`` is to make sure that | 
 | passes interact with each other correctly.  Because ``PassManager`` tries to | 
 | :ref:`optimize the execution of passes <writing-an-llvm-pass-passmanager>` it | 
 | must know how the passes interact with each other and what dependencies exist | 
 | between the various passes.  To track this, each pass can declare the set of | 
 | passes that are required to be executed before the current pass, and the passes | 
 | which are invalidated by the current pass. | 
 |  | 
 | Typically this functionality is used to require that analysis results are | 
 | computed before your pass is run.  Running arbitrary transformation passes can | 
 | invalidate the computed analysis results, which is what the invalidation set | 
 | specifies.  If a pass does not implement the :ref:`getAnalysisUsage | 
 | <writing-an-llvm-pass-getAnalysisUsage>` method, it defaults to not having any | 
 | prerequisite passes, and invalidating **all** other passes. | 
 |  | 
 | .. _writing-an-llvm-pass-getAnalysisUsage: | 
 |  | 
 | The ``getAnalysisUsage`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual void getAnalysisUsage(AnalysisUsage &Info) const; | 
 |  | 
 | By implementing the ``getAnalysisUsage`` method, the required and invalidated | 
 | sets may be specified for your transformation.  The implementation should fill | 
 | in the `AnalysisUsage | 
 | <http://llvm.org/doxygen/classllvm_1_1AnalysisUsage.html>`_ object with | 
 | information about which passes are required and not invalidated.  To do this, a | 
 | pass may call any of the following methods on the ``AnalysisUsage`` object: | 
 |  | 
 | The ``AnalysisUsage::addRequired<>`` and ``AnalysisUsage::addRequiredTransitive<>`` methods | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | If your pass requires a previous pass to be executed (an analysis for example), | 
 | it can use one of these methods to arrange for it to be run before your pass. | 
 | LLVM has many different types of analyses and passes that can be required, | 
 | spanning the range from ``DominatorSet`` to ``BreakCriticalEdges``.  Requiring | 
 | ``BreakCriticalEdges``, for example, guarantees that there will be no critical | 
 | edges in the CFG when your pass has been run. | 
 |  | 
 | Some analyses chain to other analyses to do their job.  For example, an | 
 | `AliasAnalysis <AliasAnalysis>` implementation is required to :ref:`chain | 
 | <aliasanalysis-chaining>` to other alias analysis passes.  In cases where | 
 | analyses chain, the ``addRequiredTransitive`` method should be used instead of | 
 | the ``addRequired`` method.  This informs the ``PassManager`` that the | 
 | transitively required pass should be alive as long as the requiring pass is. | 
 |  | 
 | The ``AnalysisUsage::addPreserved<>`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | One of the jobs of the ``PassManager`` is to optimize how and when analyses are | 
 | run.  In particular, it attempts to avoid recomputing data unless it needs to. | 
 | For this reason, passes are allowed to declare that they preserve (i.e., they | 
 | don't invalidate) an existing analysis if it's available.  For example, a | 
 | simple constant folding pass would not modify the CFG, so it can't possibly | 
 | affect the results of dominator analysis.  By default, all passes are assumed | 
 | to invalidate all others. | 
 |  | 
 | The ``AnalysisUsage`` class provides several methods which are useful in | 
 | certain circumstances that are related to ``addPreserved``.  In particular, the | 
 | ``setPreservesAll`` method can be called to indicate that the pass does not | 
 | modify the LLVM program at all (which is true for analyses), and the | 
 | ``setPreservesCFG`` method can be used by transformations that change | 
 | instructions in the program but do not modify the CFG or terminator | 
 | instructions (note that this property is implicitly set for | 
 | :ref:`BasicBlockPass <writing-an-llvm-pass-BasicBlockPass>`\ es). | 
 |  | 
 | ``addPreserved`` is particularly useful for transformations like | 
 | ``BreakCriticalEdges``.  This pass knows how to update a small set of loop and | 
 | dominator related analyses if they exist, so it can preserve them, despite the | 
 | fact that it hacks on the CFG. | 
 |  | 
 | Example implementations of ``getAnalysisUsage`` | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   // This example modifies the program, but does not modify the CFG | 
 |   void LICM::getAnalysisUsage(AnalysisUsage &AU) const { | 
 |     AU.setPreservesCFG(); | 
 |     AU.addRequired<LoopInfoWrapperPass>(); | 
 |   } | 
 |  | 
 | .. _writing-an-llvm-pass-getAnalysis: | 
 |  | 
 | The ``getAnalysis<>`` and ``getAnalysisIfAvailable<>`` methods | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | The ``Pass::getAnalysis<>`` method is automatically inherited by your class, | 
 | providing you with access to the passes that you declared that you required | 
 | with the :ref:`getAnalysisUsage <writing-an-llvm-pass-getAnalysisUsage>` | 
 | method.  It takes a single template argument that specifies which pass class | 
 | you want, and returns a reference to that pass.  For example: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   bool LICM::runOnFunction(Function &F) { | 
 |     LoopInfo &LI = getAnalysis<LoopInfoWrapperPass>().getLoopInfo(); | 
 |     //... | 
 |   } | 
 |  | 
 | This method call returns a reference to the pass desired.  You may get a | 
 | runtime assertion failure if you attempt to get an analysis that you did not | 
 | declare as required in your :ref:`getAnalysisUsage | 
 | <writing-an-llvm-pass-getAnalysisUsage>` implementation.  This method can be | 
 | called by your ``run*`` method implementation, or by any other local method | 
 | invoked by your ``run*`` method. | 
 |  | 
 | A module level pass can use function level analysis info using this interface. | 
 | For example: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   bool ModuleLevelPass::runOnModule(Module &M) { | 
 |     //... | 
 |     DominatorTree &DT = getAnalysis<DominatorTree>(Func); | 
 |     //... | 
 |   } | 
 |  | 
 | In above example, ``runOnFunction`` for ``DominatorTree`` is called by pass | 
 | manager before returning a reference to the desired pass. | 
 |  | 
 | If your pass is capable of updating analyses if they exist (e.g., | 
 | ``BreakCriticalEdges``, as described above), you can use the | 
 | ``getAnalysisIfAvailable`` method, which returns a pointer to the analysis if | 
 | it is active.  For example: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   if (DominatorSet *DS = getAnalysisIfAvailable<DominatorSet>()) { | 
 |     // A DominatorSet is active.  This code will update it. | 
 |   } | 
 |  | 
 | Implementing Analysis Groups | 
 | ---------------------------- | 
 |  | 
 | Now that we understand the basics of how passes are defined, how they are used, | 
 | and how they are required from other passes, it's time to get a little bit | 
 | fancier.  All of the pass relationships that we have seen so far are very | 
 | simple: one pass depends on one other specific pass to be run before it can | 
 | run.  For many applications, this is great, for others, more flexibility is | 
 | required. | 
 |  | 
 | In particular, some analyses are defined such that there is a single simple | 
 | interface to the analysis results, but multiple ways of calculating them. | 
 | Consider alias analysis for example.  The most trivial alias analysis returns | 
 | "may alias" for any alias query.  The most sophisticated analysis a | 
 | flow-sensitive, context-sensitive interprocedural analysis that can take a | 
 | significant amount of time to execute (and obviously, there is a lot of room | 
 | between these two extremes for other implementations).  To cleanly support | 
 | situations like this, the LLVM Pass Infrastructure supports the notion of | 
 | Analysis Groups. | 
 |  | 
 | Analysis Group Concepts | 
 | ^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | An Analysis Group is a single simple interface that may be implemented by | 
 | multiple different passes.  Analysis Groups can be given human readable names | 
 | just like passes, but unlike passes, they need not derive from the ``Pass`` | 
 | class.  An analysis group may have one or more implementations, one of which is | 
 | the "default" implementation. | 
 |  | 
 | Analysis groups are used by client passes just like other passes are: the | 
 | ``AnalysisUsage::addRequired()`` and ``Pass::getAnalysis()`` methods.  In order | 
 | to resolve this requirement, the :ref:`PassManager | 
 | <writing-an-llvm-pass-passmanager>` scans the available passes to see if any | 
 | implementations of the analysis group are available.  If none is available, the | 
 | default implementation is created for the pass to use.  All standard rules for | 
 | :ref:`interaction between passes <writing-an-llvm-pass-interaction>` still | 
 | apply. | 
 |  | 
 | Although :ref:`Pass Registration <writing-an-llvm-pass-registration>` is | 
 | optional for normal passes, all analysis group implementations must be | 
 | registered, and must use the :ref:`INITIALIZE_AG_PASS | 
 | <writing-an-llvm-pass-RegisterAnalysisGroup>` template to join the | 
 | implementation pool.  Also, a default implementation of the interface **must** | 
 | be registered with :ref:`RegisterAnalysisGroup | 
 | <writing-an-llvm-pass-RegisterAnalysisGroup>`. | 
 |  | 
 | As a concrete example of an Analysis Group in action, consider the | 
 | `AliasAnalysis <http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html>`_ | 
 | analysis group.  The default implementation of the alias analysis interface | 
 | (the `basicaa <http://llvm.org/doxygen/structBasicAliasAnalysis.html>`_ pass) | 
 | just does a few simple checks that don't require significant analysis to | 
 | compute (such as: two different globals can never alias each other, etc). | 
 | Passes that use the `AliasAnalysis | 
 | <http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html>`_ interface (for | 
 | example the `gvn <http://llvm.org/doxygen/classllvm_1_1GVN.html>`_ pass), do not | 
 | care which implementation of alias analysis is actually provided, they just use | 
 | the designated interface. | 
 |  | 
 | From the user's perspective, commands work just like normal.  Issuing the | 
 | command ``opt -gvn ...`` will cause the ``basicaa`` class to be instantiated | 
 | and added to the pass sequence.  Issuing the command ``opt -somefancyaa -gvn | 
 | ...`` will cause the ``gvn`` pass to use the ``somefancyaa`` alias analysis | 
 | (which doesn't actually exist, it's just a hypothetical example) instead. | 
 |  | 
 | .. _writing-an-llvm-pass-RegisterAnalysisGroup: | 
 |  | 
 | Using ``RegisterAnalysisGroup`` | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | The ``RegisterAnalysisGroup`` template is used to register the analysis group | 
 | itself, while the ``INITIALIZE_AG_PASS`` is used to add pass implementations to | 
 | the analysis group.  First, an analysis group should be registered, with a | 
 | human readable name provided for it.  Unlike registration of passes, there is | 
 | no command line argument to be specified for the Analysis Group Interface | 
 | itself, because it is "abstract": | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   static RegisterAnalysisGroup<AliasAnalysis> A("Alias Analysis"); | 
 |  | 
 | Once the analysis is registered, passes can declare that they are valid | 
 | implementations of the interface by using the following code: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   namespace { | 
 |     // Declare that we implement the AliasAnalysis interface | 
 |     INITIALIZE_AG_PASS(FancyAA, AliasAnalysis , "somefancyaa", | 
 |         "A more complex alias analysis implementation", | 
 |         false,  // Is CFG Only? | 
 |         true,   // Is Analysis? | 
 |         false); // Is default Analysis Group implementation? | 
 |   } | 
 |  | 
 | This just shows a class ``FancyAA`` that uses the ``INITIALIZE_AG_PASS`` macro | 
 | both to register and to "join" the `AliasAnalysis | 
 | <http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html>`_ analysis group. | 
 | Every implementation of an analysis group should join using this macro. | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   namespace { | 
 |     // Declare that we implement the AliasAnalysis interface | 
 |     INITIALIZE_AG_PASS(BasicAA, AliasAnalysis, "basicaa", | 
 |         "Basic Alias Analysis (default AA impl)", | 
 |         false, // Is CFG Only? | 
 |         true,  // Is Analysis? | 
 |         true); // Is default Analysis Group implementation? | 
 |   } | 
 |  | 
 | Here we show how the default implementation is specified (using the final | 
 | argument to the ``INITIALIZE_AG_PASS`` template).  There must be exactly one | 
 | default implementation available at all times for an Analysis Group to be used. | 
 | Only default implementation can derive from ``ImmutablePass``.  Here we declare | 
 | that the `BasicAliasAnalysis | 
 | <http://llvm.org/doxygen/structBasicAliasAnalysis.html>`_ pass is the default | 
 | implementation for the interface. | 
 |  | 
 | Pass Statistics | 
 | =============== | 
 |  | 
 | The `Statistic <http://llvm.org/doxygen/Statistic_8h_source.html>`_ class is | 
 | designed to be an easy way to expose various success metrics from passes. | 
 | These statistics are printed at the end of a run, when the :option:`-stats` | 
 | command line option is enabled on the command line.  See the :ref:`Statistics | 
 | section <Statistic>` in the Programmer's Manual for details. | 
 |  | 
 | .. _writing-an-llvm-pass-passmanager: | 
 |  | 
 | What PassManager does | 
 | --------------------- | 
 |  | 
 | The `PassManager <http://llvm.org/doxygen/PassManager_8h_source.html>`_ `class | 
 | <http://llvm.org/doxygen/classllvm_1_1PassManager.html>`_ takes a list of | 
 | passes, ensures their :ref:`prerequisites <writing-an-llvm-pass-interaction>` | 
 | are set up correctly, and then schedules passes to run efficiently.  All of the | 
 | LLVM tools that run passes use the PassManager for execution of these passes. | 
 |  | 
 | The PassManager does two main things to try to reduce the execution time of a | 
 | series of passes: | 
 |  | 
 | #. **Share analysis results.**  The ``PassManager`` attempts to avoid | 
 |    recomputing analysis results as much as possible.  This means keeping track | 
 |    of which analyses are available already, which analyses get invalidated, and | 
 |    which analyses are needed to be run for a pass.  An important part of work | 
 |    is that the ``PassManager`` tracks the exact lifetime of all analysis | 
 |    results, allowing it to :ref:`free memory | 
 |    <writing-an-llvm-pass-releaseMemory>` allocated to holding analysis results | 
 |    as soon as they are no longer needed. | 
 |  | 
 | #. **Pipeline the execution of passes on the program.**  The ``PassManager`` | 
 |    attempts to get better cache and memory usage behavior out of a series of | 
 |    passes by pipelining the passes together.  This means that, given a series | 
 |    of consecutive :ref:`FunctionPass <writing-an-llvm-pass-FunctionPass>`, it | 
 |    will execute all of the :ref:`FunctionPass | 
 |    <writing-an-llvm-pass-FunctionPass>` on the first function, then all of the | 
 |    :ref:`FunctionPasses <writing-an-llvm-pass-FunctionPass>` on the second | 
 |    function, etc... until the entire program has been run through the passes. | 
 |  | 
 |    This improves the cache behavior of the compiler, because it is only | 
 |    touching the LLVM program representation for a single function at a time, | 
 |    instead of traversing the entire program.  It reduces the memory consumption | 
 |    of compiler, because, for example, only one `DominatorSet | 
 |    <http://llvm.org/doxygen/classllvm_1_1DominatorSet.html>`_ needs to be | 
 |    calculated at a time.  This also makes it possible to implement some | 
 |    :ref:`interesting enhancements <writing-an-llvm-pass-SMP>` in the future. | 
 |  | 
 | The effectiveness of the ``PassManager`` is influenced directly by how much | 
 | information it has about the behaviors of the passes it is scheduling.  For | 
 | example, the "preserved" set is intentionally conservative in the face of an | 
 | unimplemented :ref:`getAnalysisUsage <writing-an-llvm-pass-getAnalysisUsage>` | 
 | method.  Not implementing when it should be implemented will have the effect of | 
 | not allowing any analysis results to live across the execution of your pass. | 
 |  | 
 | The ``PassManager`` class exposes a ``--debug-pass`` command line options that | 
 | is useful for debugging pass execution, seeing how things work, and diagnosing | 
 | when you should be preserving more analyses than you currently are.  (To get | 
 | information about all of the variants of the ``--debug-pass`` option, just type | 
 | "``opt -help-hidden``"). | 
 |  | 
 | By using the --debug-pass=Structure option, for example, we can see how our | 
 | :ref:`Hello World <writing-an-llvm-pass-basiccode>` pass interacts with other | 
 | passes.  Lets try it out with the gvn and licm passes: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -gvn -licm --debug-pass=Structure < hello.bc > /dev/null | 
 |   ModulePass Manager | 
 |     FunctionPass Manager | 
 |       Dominator Tree Construction | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Memory Dependence Analysis | 
 |       Global Value Numbering | 
 |       Natural Loop Information | 
 |       Canonicalize natural loops | 
 |       Loop-Closed SSA Form Pass | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Scalar Evolution Analysis | 
 |       Loop Pass Manager | 
 |         Loop Invariant Code Motion | 
 |       Module Verifier | 
 |     Bitcode Writer | 
 |  | 
 | This output shows us when passes are constructed. | 
 | Here we see that GVN uses dominator tree information to do its job.  The LICM pass | 
 | uses natural loop information, which uses dominator tree as well. | 
 |  | 
 | After the LICM pass, the module verifier runs (which is automatically added by | 
 | the :program:`opt` tool), which uses the dominator tree to check that the | 
 | resultant LLVM code is well formed. Note that the dominator tree is computed | 
 | once, and shared by three passes. | 
 |  | 
 | Lets see how this changes when we run the :ref:`Hello World | 
 | <writing-an-llvm-pass-basiccode>` pass in between the two passes: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -gvn -hello -licm --debug-pass=Structure < hello.bc > /dev/null | 
 |   ModulePass Manager | 
 |     FunctionPass Manager | 
 |       Dominator Tree Construction | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Memory Dependence Analysis | 
 |       Global Value Numbering | 
 |       Hello World Pass | 
 |       Dominator Tree Construction | 
 |       Natural Loop Information | 
 |       Canonicalize natural loops | 
 |       Loop-Closed SSA Form Pass | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Scalar Evolution Analysis | 
 |       Loop Pass Manager | 
 |         Loop Invariant Code Motion | 
 |       Module Verifier | 
 |     Bitcode Writer | 
 |   Hello: __main | 
 |   Hello: puts | 
 |   Hello: main | 
 |  | 
 | Here we see that the :ref:`Hello World <writing-an-llvm-pass-basiccode>` pass | 
 | has killed the Dominator Tree pass, even though it doesn't modify the code at | 
 | all!  To fix this, we need to add the following :ref:`getAnalysisUsage | 
 | <writing-an-llvm-pass-getAnalysisUsage>` method to our pass: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   // We don't modify the program, so we preserve all analyses | 
 |   void getAnalysisUsage(AnalysisUsage &AU) const override { | 
 |     AU.setPreservesAll(); | 
 |   } | 
 |  | 
 | Now when we run our pass, we get this output: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ opt -load lib/LLVMHello.so -gvn -hello -licm --debug-pass=Structure < hello.bc > /dev/null | 
 |   Pass Arguments:  -gvn -hello -licm | 
 |   ModulePass Manager | 
 |     FunctionPass Manager | 
 |       Dominator Tree Construction | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Memory Dependence Analysis | 
 |       Global Value Numbering | 
 |       Hello World Pass | 
 |       Natural Loop Information | 
 |       Canonicalize natural loops | 
 |       Loop-Closed SSA Form Pass | 
 |       Basic Alias Analysis (stateless AA impl) | 
 |       Function Alias Analysis Results | 
 |       Scalar Evolution Analysis | 
 |       Loop Pass Manager | 
 |         Loop Invariant Code Motion | 
 |       Module Verifier | 
 |     Bitcode Writer | 
 |   Hello: __main | 
 |   Hello: puts | 
 |   Hello: main | 
 |  | 
 | Which shows that we don't accidentally invalidate dominator information | 
 | anymore, and therefore do not have to compute it twice. | 
 |  | 
 | .. _writing-an-llvm-pass-releaseMemory: | 
 |  | 
 | The ``releaseMemory`` method | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   virtual void releaseMemory(); | 
 |  | 
 | The ``PassManager`` automatically determines when to compute analysis results, | 
 | and how long to keep them around for.  Because the lifetime of the pass object | 
 | itself is effectively the entire duration of the compilation process, we need | 
 | some way to free analysis results when they are no longer useful.  The | 
 | ``releaseMemory`` virtual method is the way to do this. | 
 |  | 
 | If you are writing an analysis or any other pass that retains a significant | 
 | amount of state (for use by another pass which "requires" your pass and uses | 
 | the :ref:`getAnalysis <writing-an-llvm-pass-getAnalysis>` method) you should | 
 | implement ``releaseMemory`` to, well, release the memory allocated to maintain | 
 | this internal state.  This method is called after the ``run*`` method for the | 
 | class, before the next call of ``run*`` in your pass. | 
 |  | 
 | Registering dynamically loaded passes | 
 | ===================================== | 
 |  | 
 | *Size matters* when constructing production quality tools using LLVM, both for | 
 | the purposes of distribution, and for regulating the resident code size when | 
 | running on the target system.  Therefore, it becomes desirable to selectively | 
 | use some passes, while omitting others and maintain the flexibility to change | 
 | configurations later on.  You want to be able to do all this, and, provide | 
 | feedback to the user.  This is where pass registration comes into play. | 
 |  | 
 | The fundamental mechanisms for pass registration are the | 
 | ``MachinePassRegistry`` class and subclasses of ``MachinePassRegistryNode``. | 
 |  | 
 | An instance of ``MachinePassRegistry`` is used to maintain a list of | 
 | ``MachinePassRegistryNode`` objects.  This instance maintains the list and | 
 | communicates additions and deletions to the command line interface. | 
 |  | 
 | An instance of ``MachinePassRegistryNode`` subclass is used to maintain | 
 | information provided about a particular pass.  This information includes the | 
 | command line name, the command help string and the address of the function used | 
 | to create an instance of the pass.  A global static constructor of one of these | 
 | instances *registers* with a corresponding ``MachinePassRegistry``, the static | 
 | destructor *unregisters*.  Thus a pass that is statically linked in the tool | 
 | will be registered at start up.  A dynamically loaded pass will register on | 
 | load and unregister at unload. | 
 |  | 
 | Using existing registries | 
 | ------------------------- | 
 |  | 
 | There are predefined registries to track instruction scheduling | 
 | (``RegisterScheduler``) and register allocation (``RegisterRegAlloc``) machine | 
 | passes.  Here we will describe how to *register* a register allocator machine | 
 | pass. | 
 |  | 
 | Implement your register allocator machine pass.  In your register allocator | 
 | ``.cpp`` file add the following include: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   #include "llvm/CodeGen/RegAllocRegistry.h" | 
 |  | 
 | Also in your register allocator ``.cpp`` file, define a creator function in the | 
 | form: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   FunctionPass *createMyRegisterAllocator() { | 
 |     return new MyRegisterAllocator(); | 
 |   } | 
 |  | 
 | Note that the signature of this function should match the type of | 
 | ``RegisterRegAlloc::FunctionPassCtor``.  In the same file add the "installing" | 
 | declaration, in the form: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   static RegisterRegAlloc myRegAlloc("myregalloc", | 
 |                                      "my register allocator help string", | 
 |                                      createMyRegisterAllocator); | 
 |  | 
 | Note the two spaces prior to the help string produces a tidy result on the | 
 | :option:`-help` query. | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ llc -help | 
 |     ... | 
 |     -regalloc                    - Register allocator to use (default=linearscan) | 
 |       =linearscan                -   linear scan register allocator | 
 |       =local                     -   local register allocator | 
 |       =simple                    -   simple register allocator | 
 |       =myregalloc                -   my register allocator help string | 
 |     ... | 
 |  | 
 | And that's it.  The user is now free to use ``-regalloc=myregalloc`` as an | 
 | option.  Registering instruction schedulers is similar except use the | 
 | ``RegisterScheduler`` class.  Note that the | 
 | ``RegisterScheduler::FunctionPassCtor`` is significantly different from | 
 | ``RegisterRegAlloc::FunctionPassCtor``. | 
 |  | 
 | To force the load/linking of your register allocator into the | 
 | :program:`llc`/:program:`lli` tools, add your creator function's global | 
 | declaration to ``Passes.h`` and add a "pseudo" call line to | 
 | ``llvm/Codegen/LinkAllCodegenComponents.h``. | 
 |  | 
 | Creating new registries | 
 | ----------------------- | 
 |  | 
 | The easiest way to get started is to clone one of the existing registries; we | 
 | recommend ``llvm/CodeGen/RegAllocRegistry.h``.  The key things to modify are | 
 | the class name and the ``FunctionPassCtor`` type. | 
 |  | 
 | Then you need to declare the registry.  Example: if your pass registry is | 
 | ``RegisterMyPasses`` then define: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   MachinePassRegistry RegisterMyPasses::Registry; | 
 |  | 
 | And finally, declare the command line option for your passes.  Example: | 
 |  | 
 | .. code-block:: c++ | 
 |  | 
 |   cl::opt<RegisterMyPasses::FunctionPassCtor, false, | 
 |           RegisterPassParser<RegisterMyPasses> > | 
 |   MyPassOpt("mypass", | 
 |             cl::init(&createDefaultMyPass), | 
 |             cl::desc("my pass option help")); | 
 |  | 
 | Here the command option is "``mypass``", with ``createDefaultMyPass`` as the | 
 | default creator. | 
 |  | 
 | Using GDB with dynamically loaded passes | 
 | ---------------------------------------- | 
 |  | 
 | Unfortunately, using GDB with dynamically loaded passes is not as easy as it | 
 | should be.  First of all, you can't set a breakpoint in a shared object that | 
 | has not been loaded yet, and second of all there are problems with inlined | 
 | functions in shared objects.  Here are some suggestions to debugging your pass | 
 | with GDB. | 
 |  | 
 | For sake of discussion, I'm going to assume that you are debugging a | 
 | transformation invoked by :program:`opt`, although nothing described here | 
 | depends on that. | 
 |  | 
 | Setting a breakpoint in your pass | 
 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | First thing you do is start gdb on the opt process: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ gdb opt | 
 |   GNU gdb 5.0 | 
 |   Copyright 2000 Free Software Foundation, Inc. | 
 |   GDB is free software, covered by the GNU General Public License, and you are | 
 |   welcome to change it and/or distribute copies of it under certain conditions. | 
 |   Type "show copying" to see the conditions. | 
 |   There is absolutely no warranty for GDB.  Type "show warranty" for details. | 
 |   This GDB was configured as "sparc-sun-solaris2.6"... | 
 |   (gdb) | 
 |  | 
 | Note that :program:`opt` has a lot of debugging information in it, so it takes | 
 | time to load.  Be patient.  Since we cannot set a breakpoint in our pass yet | 
 | (the shared object isn't loaded until runtime), we must execute the process, | 
 | and have it stop before it invokes our pass, but after it has loaded the shared | 
 | object.  The most foolproof way of doing this is to set a breakpoint in | 
 | ``PassManager::run`` and then run the process with the arguments you want: | 
 |  | 
 | .. code-block:: console | 
 |  | 
 |   $ (gdb) break llvm::PassManager::run | 
 |   Breakpoint 1 at 0x2413bc: file Pass.cpp, line 70. | 
 |   (gdb) run test.bc -load $(LLVMTOP)/llvm/Debug+Asserts/lib/[libname].so -[passoption] | 
 |   Starting program: opt test.bc -load $(LLVMTOP)/llvm/Debug+Asserts/lib/[libname].so -[passoption] | 
 |   Breakpoint 1, PassManager::run (this=0xffbef174, M=@0x70b298) at Pass.cpp:70 | 
 |   70      bool PassManager::run(Module &M) { return PM->run(M); } | 
 |   (gdb) | 
 |  | 
 | Once the :program:`opt` stops in the ``PassManager::run`` method you are now | 
 | free to set breakpoints in your pass so that you can trace through execution or | 
 | do other standard debugging stuff. | 
 |  | 
 | Miscellaneous Problems | 
 | ^^^^^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | Once you have the basics down, there are a couple of problems that GDB has, | 
 | some with solutions, some without. | 
 |  | 
 | * Inline functions have bogus stack information.  In general, GDB does a pretty | 
 |   good job getting stack traces and stepping through inline functions.  When a | 
 |   pass is dynamically loaded however, it somehow completely loses this | 
 |   capability.  The only solution I know of is to de-inline a function (move it | 
 |   from the body of a class to a ``.cpp`` file). | 
 |  | 
 | * Restarting the program breaks breakpoints.  After following the information | 
 |   above, you have succeeded in getting some breakpoints planted in your pass. | 
 |   Next thing you know, you restart the program (i.e., you type "``run``" again), | 
 |   and you start getting errors about breakpoints being unsettable.  The only | 
 |   way I have found to "fix" this problem is to delete the breakpoints that are | 
 |   already set in your pass, run the program, and re-set the breakpoints once | 
 |   execution stops in ``PassManager::run``. | 
 |  | 
 | Hopefully these tips will help with common case debugging situations.  If you'd | 
 | like to contribute some tips of your own, just contact `Chris | 
 | <mailto:sabre@nondot.org>`_. | 
 |  | 
 | Future extensions planned | 
 | ------------------------- | 
 |  | 
 | Although the LLVM Pass Infrastructure is very capable as it stands, and does | 
 | some nifty stuff, there are things we'd like to add in the future.  Here is | 
 | where we are going: | 
 |  | 
 | .. _writing-an-llvm-pass-SMP: | 
 |  | 
 | Multithreaded LLVM | 
 | ^^^^^^^^^^^^^^^^^^ | 
 |  | 
 | Multiple CPU machines are becoming more common and compilation can never be | 
 | fast enough: obviously we should allow for a multithreaded compiler.  Because | 
 | of the semantics defined for passes above (specifically they cannot maintain | 
 | state across invocations of their ``run*`` methods), a nice clean way to | 
 | implement a multithreaded compiler would be for the ``PassManager`` class to | 
 | create multiple instances of each pass object, and allow the separate instances | 
 | to be hacking on different parts of the program at the same time. | 
 |  | 
 | This implementation would prevent each of the passes from having to implement | 
 | multithreaded constructs, requiring only the LLVM core to have locking in a few | 
 | places (for global resources).  Although this is a simple extension, we simply | 
 | haven't had time (or multiprocessor machines, thus a reason) to implement this. | 
 | Despite that, we have kept the LLVM passes SMP ready, and you should too. | 
 |  |