Introduction
 
  
Apache Cocoon is an XML publishing framework. It allows you to define XML
     documents and transformations to be applied on it, to eventually
     generate a presentation format of your choice (HTML, PDF, SVG, etc.).
     Cocoon also gives you the possibility to have logic in your XML files
     (so that the XML file itself can become dynamically generated).
 
  
Cocoon is developed on top of the Avalon Server Framework, which is a
     stable and scalable framework.  You can find out more about Avalon in
     this document: (ref: Avalon White Paper).  I highly suggest reading
     this white paper as it covers many concepts that are key to Cocoon,
     namely Separation of Concerns (SOC) and Inversion of Control (IoC).
     It also covers foundational aspects of the Avalon Framework, so you
     can have a better understanding on how Cocoon is structured.
 
  
Cocoon helps you separate out concern areas for web development.
     The areas addressed are Logic, Content, Style, and Management.  There
     are different mechanisms for each.
 
  
In order to learn how to use Cocoon, first make sure that you
     install it properly, then investigate
     the many samples.
     The following screenshots come from the
     "tutorial" that is provided with Cocoon.
     After you have built the demo webapp as per the installation
     instructions (build webapp) then you can see this tutorial
     in action via the Samples pages.
 
  
Separating Concerns
    
Cocoon is designed to allow Developers, Business Analysts,
       Designers, and Administrators to work with each other without breaking
       the other person's contribution.  The problem with using just JSPs, ASPs,
       or ColdFusion templates is that all of the look, feel, and logic are
       intertwined.  That means that maintenance is much more difficult, and the
       project's true costs are delayed until the customer wants feature
       enhancements or bugs fixed.  This also means that if the site design is
       introduced late in the game, the cost of revamping the site becomes much
       higher.
 
    
Developers
      
Developer's jobs are to create the business logic and object model
         behind the web application.  They are more concerned with functionality
         than with layout or the words displayed on a screen.  These are the
         people that will develop the Actions (Components that only process
         information) and the hooks for how to get the necessary information
         from business objects.
 
    
    
Business Analysts
      
The Business Analysts are the people who are concerned with the words
         displayed on the screen, and to a certain extent, the layout.
         Typically, they will be using the work done by the developer to put
         together a generic markup that will be transformed into the results.
         In small development environments, many times the developer takes on
         both this role and the developer role.  Typically, the business analyst
         will be working with the markup language that goes into the 
         generator.
 
    
    
Designers
      
The designer is the person or group of people who are responsible to
         provide the final look and feel of a site.  The designer does all the
         graphics and HTML code.  In Cocoon, they will be working with the
         Transformers that take an input and structure it in a final
         presentation.
 
    
    
Administrators
      
The administrator is responsible for the sitemap which maps the URI
         space to the different pipelines in Cocoon.  A pipeline is a path from
         a Generator to a Serializer.  This means, that the administrator
         decides that all requests for a resource with a ".html"
         extension starts out as XML and ends up as HTML.  The Administrator
         will work closely with the Designers and the Developers.  In the
         absence of a dedicated administrator, one developer should assume that
         role.  It is important that developers do not get bogged down in this
         one Component.
 
    
  
  
Development Style
    
You have to decide early on whether you will develop from a Business
       Markup perspective, or develop from a Document Markup perspective.  They
       have different ways of approaching the same problem.  Both approaches
       have its tradeoffs.  In the end, you will find that you will need a
       combination of different aspects of the two approaches.
 
    
Business Markup Centric
      
This approach makes the Business Object the center of attention for
         development.  This approach formalizes your business objects, and makes
         sure that you always represent a business object in a standard manner. 
         It's limitations come to bear when you have cases when you need
         two different objects that need to be represented on the same logical
         page.
 
    
    
Document Markup Centric
      
This approach feels the most natural to developers who come from
         backgrounds with scripting languages.  This approach is a bit more
         flexible in that you represent a page logically, with the wording as
         the center of attention.  With this approach, it is up to the developer
         to ensure that the business object is represented in a consistent
         manner.
 
    
    
Hybrid Approach
      
We will develop a hybrid approach to development in this paper.  What
         this means is that we start with a Document Markup Centric approach,
         and add in support for specific Business Markup as it is needed.  In
         the end, this is the most flexible and maintainable method for
         development.
 
    
  
  
The Concept
    
For the sake of this paper, we are going to develop a very simple
       database-backed application that manages users and departments.  Each
       element has a name and an identifier.  A department can have many
       employees, but each employee can only have one department.  We will be
       able to create, change, and delete both employees and departments.
 
    
The SQL
  |   |   | 
 
  | 
CREATE TABLE department {
    department_id INT NOT NULL,
    department_name VARCHAR (64) NOT NULL
};
CREATE TABLE employee {
    employee_id INT NOT NULL,
    employee_name VARCHAR (64) NOT NULL,
    department_id INT NOT NULL
};
ALTER TABLE department ADD
    PRIMARY KEY pkDepartment (department_id);
ALTER TABLE employee ADD
    PRIMARY KEY pkEmployee (employee_id);
ALTER TABLE employee ADD
    FOREIGN KEY department_id (department.department_id);
 |   | 
 
  |   |   | 
 
 
 
    
    
Facilities
      
        
- 
Create Department (need name only)
 
        
- 
Update Department (change name, reassign potential employees to
          department, create employee for department)
 
        
- 
Delete Department
 
        
- 
Find Department (by name, or by ID)
 
        
- 
Create Employee (need name and department-create department if
          needed)
 
        
- 
Update Employee (change name, reassign department-create
          department if needed)
 
        
- 
Delete Employee
 
        
- 
Find Employees (by name, by ID, or by Department)
 
      
 
 
    
    
Layouts
     
Various screenshots
      are available as a separate document, to portray the layout of
      interfaces and results pages - apply your own style.
 
    
  
 
 
Diving In
 
  
In order to do anything in Cocoon, you will need a sitemap.  At this point
     we will not go into detail but we will show you how to put an entry in so
     you can see your stuff.  In most development situations, the sitemap will
     be set up for you.  Since we want to start with a clean slate, take the
     sitemap that comes with Cocoon's 
     samples
     and clear out everything under
     the <map:pipelines> tag.  Next, you will add an entry
     in the same location that looks like this:
 
  |   |   | 
 
  | 
  
<map:pipeline>
   <map:match pattern="">
     <map:redirect-to uri="home.html"/>
   </map:match>
   <map:match pattern="**.xml">
     <map:generate src="docs/{1}.xml"/>
     <map:serialize type="xml"/>
   </map:match>
   <map:match pattern="**.html">
     <map:generate src="docs/{1}.xml"/>
     <map:transform src="stylesheets/apache.xsl"/>
     <map:serialize/>
   </map:match>
   <map:match pattern="images/**.gif">
    <map:read src="resources/images/{1}.gif" mime-type="image/gif"/>
   </map:match>
   <map:match pattern="images/**.jpg">
    <map:read src="resources/images/{1}.jpg" mime-type="image/jpg"/>
   </map:match>
   <map:match pattern="images/**.png">
    <map:read src="resources/images/{1}.png" mime-type="image/png"/>
   </map:match>
   <map:match pattern="resources/**.css">
     <map:read src="resources/styles/{1}.css" mime-type="text/css"/>
   </map:match>
   <map:match pattern="resources/**.js">
     <map:read src="resource/styles/{1}.js"
               mime-type="application/x-javascript"/>
   </map:match>
  <map:handle-errors>
    <map:transform src="stylesheets/system/error2html.xsl"/>
    <map:serialize status-code="500"/>
  </map:handle-errors>
</map:pipeline>
  
 |   | 
 
  |   |   | 
 
 
 
    
What this does is tell the sitemap that we want to capture all URLs
       with a ".xml" extension, and find an equivalent file in the
       "docs" subdirectory.  We are not performing any transformations
       at this time.  The Sitemap is really a site administrator's job to
       maintain.  There are some exceptions to this general rule, but we will
       discuss them when needed.  We will use the Document Markup specified in
       the StyleBook DTD format.
 
   
Creating the Pages
    
Since we are only looking at XML right now, we need to make sure our
       pages conform to the markup standards.  You will see how well this comes
       in handy for debugging XSP (XML Server Pages) markup.  Since we already
       have the Layout specified, and the database created, we will create our
       markup.
 
    
Our home page is going to be really simple: a list of links that take us
       to the main pages.
 
  |   |   | 
 
  | 
  
<document>
  <header>
    <title>Home Page</title>
  </header>
  <body>
    <s1 title="Welcome to Personnel Administrator">
      <p>
        Welcome to our Personnel Administrator.  You
        can perform one of the following functions:
      </p>
      <ul>
        <li>
          <link href="search-dept.html">Search Departments</link>
        </li>
        <li>
          <link href="search-empl.html">Search Employees</link>
        </li>
        <li>
          <link href="create-dept.html">Create Departments</link>
        </li>
        <li>
          <link href="edit-dept.html">Edit a Department</link>
        </li>
        <li>
          <link href="create-empl.html">Create Employee</link>
        </li>
        <li>
          <link href="edit-empl.html">Edit an Employee</link>
        </li>
      </ul>
    </s1>
  </body>
</document>
  
 |   | 
 
  |   |   | 
 
 
 
     
Even though this doesn't look like much right now, we have two
        entries: "**.xml" and "**.html" for the same
        resource.  Look at "home.html", and see how it looks now.
        Quite a difference.  Don't remove the entry for viewing the page
        as XML yet.  We need to use it to debug our XSP pages later.
 
     
Our First Form
       
For now, we are going to skip the search functionality, and jump to
          our "create" templates.  It is important to realize the
          proper method of form handling.  While it is possible to create XSP
          pages that perform the logic for you, this approach is not very
          maintainable.  We also have to choose whether we will directly access
          the database, or encapsulate that logic in objects.
 
       
The tradeoffs are that the direct SQL access is faster to get started,
          but that it is harder to maintain in the end.  You may decide to start
          with the direct SQL access at the beginning of a project, and build
          the objects later.  With that in mind, we will use some functionality
          that Cocoon has built in to make this approach a little easier.
          Cocoon has a group of Database actions that allow you to map form
          fields to dynamically created SQL calls.  It also has a logicsheet
          that makes creating SQL bound pages a little easier.
 
       
Our first form is the "Create a Department" form.  The
          website specification is missing the tags for form building, we will
          provide an example here:
 
  |   |   | 
 
  | 
  
<document>
  <header>
    <title>Department</title>
  </header>
  <body>
    <s1 title="Create a Department">
      <form handler="create-dept.html">
        <p>
          You can create a department by typing in the
          name and pressing the "submit" button.
        </p>
        <p>
          Name: <text name="name" size="30" required="true"/>
        </p>
        <submit name="Create Department"/>
        <note>
          * These fields are required.
        </note>
      </form>
    </s1>
  </body>
</document> 
  
 |   | 
 
  |   |   | 
 
 
 
       
It is important to note that the "submit" tag is transformed
          into an HTML submit button with the name "cocoon-action-ACTIONNAME".
          The "cocoon-action-ACTIONNAME" form parameter is a magic value that
          Cocoon uses to select a specific action from a group of actions that
          only gets executed during that time.  You will find that this page
          displays correctly, but does not do anything yet.  The handler is
          where the navigation goes once you click on the
          "Create Department" button on the screen.  What we are going
          to do is create one confirmation page for all the Department and
          Employee pages.
 
       
Cocoon has a FormValidatorAction that will take care of ensuring the
          input results are acceptable.  It also has the following database
          actions for your convenience: DatabaseAddAction, DatabaseUpdateAction,
          DatabaseDeleteAction, and DatabaseAuthenticatorAction.  We will only
          need the Add, Update, and Delete actions for our simple webapp.  In
          order to prepare them, we create an XML configuration file that tells
          the actions how to map request parameters to database tables and place
          constraints on the parameters.  For the Department form group, it will
          look like this:
 
  |   |   | 
 
  | 
  
<root>
  <!-
      The "parameter" elements identify the root constraints for
      the FormValidatorAction.  We are specifying that the "id"
      parameter is an integer (it limits to "long", "double",
      "boolean", and "string").  We are specifying that the "name"
      parameter is a string that is at least 5 characters--but no
      more than 64 characters.
  -->
  <parameter name="id" type="long"/>
  <parameter name="name" type="string" min-len="5" max-len="64"/>
  <!-
      Each constraint set is used when we are defining a new way
      of validating a form.  We define our constraint sets by
      function.  Since we have the same basic form that is driving
      the FormValidator, we have an update set and an add set.
      Note that you can impose additional constraints than the
      default constraints listed above.  Also, you do not "have"
      to enforce a constraint.  Each "validate" element below
      identifies the parameter constraints we are enforcing.
      For more information view the JavaDocs for 
      AbstractValidatorAction
  -->
  <constraint-set name="update">
    <validate name="name"/>
    <validate name="id" nullable="no" min="1"/>
  </constraint-set>
  <constraint-set name="add">
    <validate name="name"/>
  </constraint-set>
  <!--
       This is where we identify our table mappings so that the
       Database Actions can work their magic.  Note that the
       parameter names are the same as above--as well as the same
       as form parameter names.
       First we tell the Database Actions that we are using the
       "personnel" connection pool we set up in <code>cocoon.xconf</code>.
       This file should be set up by the site administrator.
       We also tell the Database Actions the structure of the table
       we will be populating.  The keys are used to identify which
       columns will be treated as keys--they are treated different
       when the different SQL statements are created.  Note that
       there is a "mode" attribute in the key element.  The mode
       refers to how new keys will be generated.  There are three
       modes: "automatic" keys are generated by the database,
       "manual" keys are generated by manually finding the largest
       value and incrementing it, and finally "form" keys take the
       key value from a parameter on the form.
       Both keys and values serve to map parameter names to table
       columns, converting the value into the native type.  For a
       list of supported types check out the JavaDocs for
       AbstractDatabaseAction.
  -->
  <connection>personnel</connection>
  <table name="department">
    <keys>
      <key param="id" dbcol="department_id" type="int" mode="manual"/>
    </keys>
    <values>
      <value param="name" dbcol="department_name" type="string"/>
    </values>
  </table>
</root>
  
 |   | 
 
  |   |   | 
 
 
 
       
After you create the descriptor file, you will have to create some
          entries in the Sitemap so you can take advantage of the form
          descriptor.  First, the Sitemap has to be able to know how to
          reference the Actions we want.  To do that, alter the
          "map:actions" section to list all the actions we need:
 
  |   |   | 
 
  | 
  
<map:actions>
   <map:action name="dbAdd"
               src="org.apache.cocoon.acting.DatabaseAddAction"/>
   <map:action name="dbDel"
               src="org.apache.cocoon.acting.DatabaseDeleteAction"/>
   <map:action name="dbUpd"
               src="org.apache.cocoon.acting.DatabaseUpdateAction"/>
   <map:action name="form"
               src="org.apache.cocoon.acting.FormValidatorAction"/>
</map:actions>
  
 |   | 
 
  |   |   | 
 
 
 
       
Lastly, we want to create an action set.  An action set is a group of
          actions that will be applied at once.  If the action set entry has an
          "action" parameter, then the specific action is only
          executed when the ACTIONNAME of the magic "cocoon-action-ACTIONNAME"
          request parameter matches the value of the "action" parameter.  For our
          purposes, the action set we are defining is listed below (defined in
          the sitemap):
 
  |   |   | 
 
  | 
  
<map:action-sets>
  <map:action-set name="process">
   <map:act type="form" action="Create Department">
     <map:parameter name="validate-set" value="add"/>
     <map:act type="dbAdd"/>
   </map:act>
   <map:act type="form" action="Update Department">
     <map:parameter name="validate-set" value="update"/>
     <map:act type="dbUpd"/>
   </map:act>
   <map:act type="dbDel" action="Delete Department"/>
  </map:action-set>
</map:action-sets>
  
 |   | 
 
  |   |   | 
 
 
 
       
Now that we have defined the actions we want, with the parameters that
          control them during run-time, we can use it in our pipeline.
 
  |   |   | 
 
  | 
  
<map:match pattern="*-dept.html">
  <map:act set="process">
    <map:parameter name="descriptor"
                   value="context://docs/department-form.xml"/>
    <map:parameter name="form-descriptor"
                   value="context://docs/department-form.xml"/>
    <map:generate type="serverpages" src="docs/confirm-dept.xsp"/>
    <map:transform src="stylesheets/apache.xsl"/>
    <map:serialize/>
  </map:act>
  <map:generate type="serverpages" src="docs/{1}-dept.xsp"/>
  <map:transform src="stylesheets/apache.xsl"/>
  <map:serialize/>
</map:match>
<map:match pattern="*-dept.xml">
  <map:act set="process">
    <map:parameter name="descriptor"
                   value="context://docs/department-form.xml"/>
    <map:parameter name="form-descriptor"
                   value="context://docs/department-form.xml"/>
    <map:generate type="serverpages" src="docs/confirm-dept.xsp"/>
    <map:serialize type="xml"/>
  </map:act>
  <map:generate type="serverpages" src="docs/{1}-dept.xsp"/>
  <map:serialize type="xml"/>
</map:match>
  
 |   | 
 
  |   |   | 
 
 
 
       
This may not seem clear what is happening right now.  The way actions
          work is if they return a null, nothing inside the "map:act"
          entry will execute, and the request processing will flow through to
          the second "map:generate" section.  This is a side affect of
          using the FormValidatorAction.  If we choose to create our own
          business objects and form validation framework, we are not constrained by
          this construct.
 
       
In addition, we changed the type of generator we are using: we have
          made it a "serverpages" (or XSP) generator.  We made the
          transition now so that we can report information on what failed to the
          user.  First, we need to convert our "create-dept.xml" file
          to an XSP page so that we can see the page again (right now we will
          get an error).  To do this, simply add a new tag to the base of the
          document called "xsp:page" declaring the XSP namespace.  The
          change will look like this:
 
  |   |   | 
 
  | 
  
<xsp:page xmlns:xsp="http://apache.org/xsp">
  <!-- The original document will be embedded here -->
</xsp:page>
  
 
 |   | 
 
  |   |   | 
 
 
 
       
To complete the transformation, we usually change the extension to
          ".xsp" so we know what we are dealing with at a glance.
          Create a new file called "confirm.xsp" with the following
          contents:
 
  |   |   | 
 
  | 
  
<xsp:page xmlns:xsp="http://apache.org/xsp">
<document>
  <header>
    <title>Department</title>
  </header>
  <body>
    <s1 title="Department Processed">
      <p>
        You have successfully processed the department.
      </p>
    </s1>
  </body>
</document>
</xsp:page>
  
 |   | 
 
  |   |   | 
 
 
           
      
Adding support for Error Reporting
        
In order to successfully report errors processing the page, add
           another namespace declaration to the "xsp:page" element.
           The final form page will look like this:
 
  |   |   | 
 
  | 
  
<xsp:page xmlns:xsp="http://apache.org/xsp"
          xmlns:xsp-formval="http://apache.org/xsp/form-validator/2.0">
<document>
  <header>
    <title>Department</title>
  </header>
  <body>
    <s1 title="Create a Department">
      <form handler="create-dept.html">
        <p>
          You can create a department by typing in the
          name and pressing the "submit" button.
        </p>
        <p>
          Name: <text name="name" size="30" required="true"/><br />
	       <xsp:logic>
	         if (<xsp-formval:is-toosmall name="name"/>) {
	             <xsp:text>"Name" must be at least 5 characters</xsp:text>
	         } else if (<xsp-formval:is-toolarge name="name"/>) {
	             <xsp:text>"Name" was too long</xsp:text>
	         }
	       </xsp:logic>
        </p>
        <submit name="Create Department"/>
        <note>
          * These fields are required.
        </note>
      </form>
    </s1>
  </body>
</document>
</xsp:page>
  
 |   | 
 
  |   |   | 
 
 
            
      
     
     
Adding Database Support with the ESQL Logicsheet
      
The "Create Employee" page is going to require database
         access so that we know which Department a new employee is assigned to.
         This is fairly easy to accomplish with the ESQL Logicsheet.  Again,
         when you use the ESQL logicsheet, you lose some of your separation of
         concerns.
 
  |   |   | 
 
  | 
  
<xsp:page xmlns:xsp="http://apache.org/xsp"
          xmlns:xsp-formval="http://apache.org/xsp/form-validator/2.0"
          xmlns:esql="http://apache.org/cocoon/SQL/v2">
<document>
  <header>
    <title>Employee</title>
  </header>
  <body>
    <s1 title="Create an Employee">
      <form handler="create-empl.html">
        <p>
          You can create a department by typing in the
          name and pressing the "submit" button.
        </p>
        <p>
          Name: <text name="name" size="30" required="true"/><br />
	       <xsp:logic>
	         if (<xsp-formval:is-null name="name"/>) {
	             <xsp:text>"Name" cannot be empty</xsp:text>
	         } else if (<xsp-formval:is-toolarge name="name"/>) {
	             <xsp:text>"Name" was too long</xsp:text>
	         }
	       </xsp:logic>
        </p>
        <p>
          Department:
          <select name="department">
            <esql:connection>
              <!-- declare the connection pool we are using -->
              <esql:pool>personnel</esql:pool>
              <!-- query execution blocks can be repeated -->
              <esql:execute-query>
                <!-- Find all departments and order them -->
                <esql:query>
                  SELECT department_id, department_name
                  FROM department ORDER BY department_name
                </esql:query>
               <!-- What to do with the results -->
                <esql:results>
                  <!--
                       A successful query that returns results
                       executes this block.  You can also embed
                       more "execute-query" blocks inside the
                       row-results.  That way you can have queries
                       that filter information based on the results
                       of other queries.
                  -->
                  <esql:row-results>
                    <option>
                      <xsp:attribute name="name">
                        <esql:get-string column="department_id"/>
                      </xsp:attribute>
                      <esql:get-string column="department_name"/>
                    </option>
                  </esql:row-results>
                  <!--
                       Other result types are "no-results" and
                       "error-results".  A successful query that
                       does not return results (an empty resultset)
                       will use the XML embedded in the "no-results"
                       section.  An unsuccessful query that throws
                       an exception will use the XML embedded in
                       the "error-results" section.
                  -->
                </esql:results>
              </esql:execute-query>
            </esql:connection>
          </select>
        </p>
        <submit name="Create Employee"/>
        <note>
          * These fields are required.
        </note>
      </form>
    </s1>
  </body>
</document>
</xsp:page>
  
 |   | 
 
  |   |   | 
 
 
 
      
As you can see ESQL is flexible and powerful, but the cost of that
         flexibility is a loss of readability.  Using a logicsheet to wrap
         information in a business object is another alternative.  Notice how
         ESQL works:
 
       
        
- 
First, we specify our connection information which will apply to
              all queries in the ESQL structure.
 
        
- 
Next, we specify our first query we are going to use.  Note that
              you can nest queries as well as have more than one in an
              "esql:connection" element.
 
        
- 
Lastly, we specify how we process the results.  There are three
              different types of results: "esql:row-results", 
              "esql:no-results", and "esql:error-results".
              This allows you to handle different scenarios easily.  It is
              inside the individual results elements that we can nest new
              queries to process.
 
       
 
 
       
A Note About Actions
        
Actions are the bread and butter of logic processing in Cocoon.
           There are a number of approaches that you can take when developing
           Actions.  You can create a specific action for each piece of
           business logic.  This approach is very heavy handed and requires you
           to spend a lot of development time creating actions.
 
        
The preferred method for creating actions is to provide a generic
           action that can handle a wide range of specific actions.  The
           Database Actions and Validator Actions are examples of this approach.
           They will read a configuration file specified by a parameter, and
           they will modify the specific results based on the configuration
           file.  In order to take advantage of this for your own Actions, you
           can extend the AbstractComplimentaryConfigurationAction.  Basically
           what it does is encapsulate the logic for reading and caching the
           Configuration information for your Action.
 
       
     
   
   
Redirects
      
Most web developers agree that redirecting a user based on input is a
         valuable and necessary part of web development.  In Cocoon there are
         only two locations where you can issue redirects: the Sitemap and
         Actions.  In essence, Cocoon does require you to plan so that redirects
         are only used when necessary.
 
      
One approach that is good to use is to require all traffic to go
         through a URL controlling action.  The Action will test to see if the
         user is logged in, and if not will send them to the login page.
         Another derivation on this approach is to test for a user's role,
         and if they do not have access redirect them to a different page.
 
   
   
Writing an Action
     
Writing an action is as simple as writing a Component that conforms to
        the Action interface.  Be sure to examine the different Actions that are
        in the org.apache.cocoon.acting package - you might find some abstract
        actions that you can extend.  Actions are Avalon Components, so you may
        want to read Avalon's Whitepaper for more information.
 
     
 
  | Actions will return a map that contains values that the sitemap
           administrator can use in the sitemap.  If the Action returns a null,
           then anything inside the "map:act" element will not be
           executed. | 
 
 
     
Return Values
      
The Action interface specifies that it returns a Map.  This Map is
         used for value substitution in the sitemap, and communicating
         information to other Actions.  When an Action is specified in the
         sitemap, it uses the following syntax:
 
  |   |   | 
 
  | 
  
<map:act type="my-action">
  <map:generate src="{source}"/>
  <map:transform src="doc2{theme}"/>
  <map:serialize/>
</map:act>
  
 |   | 
 
  |   |   | 
 
 
 
      
The above code snippet assumes you have an Action with the name
         "my-action" already specified.  It also assumes that there
         are two "parameters" returned from the action in the Map. The
         sitemap queries the returned Map for the "source" and
         "theme" values, and substitutes their values in place of the
         curly braces that referenced it.  In other words, when it sees the
         "map:generate" with an src attribute of "{source}"
         it looks in the Map.  For our discussion, let us say the value stored
         is "index.xml".  The Sitemap will perform the substitution
         so that the src attribute now containts "index.xml".
 
      
In the case that the above the action might return a null value.  In
         that case, everything inside the "map:act" element is
         skipped.  You can use this to good advantage like the *ValidatorActions
         do.  If everything is validated correctly, they return a Map.  If there
         is an error, they return a null, and place the information in Request
         attributes.
 
     
   
 
 
Cocoon Supplied Components
 
  
Cocoon supplies a number of different Components for your use.  The types
     of Components we will discuss here are Generators, Transformers,
     Serializers, Readers, and Actions.  This are the important Components that
     allow you to do you job.
 
  
Generators
    
A Generator will create SAX events for a SAX stream-whether it reads from
       an input stream or it generates it on the fly.  All built in generators
       are in the package "org.apache.cocoon.generation".
 
    
DirectoryGenerator
      
Reads a directory, and builds an XML document based on the contents.
         You can pass parameters to it to control how it behaves (note
         parameter names are case sensitive):
 
      
        
- 
dateFormat - a format string that you would use in the Java
              SimpleDateFormat object
 
        
- 
depth - the maximum number of directories deep the generator will
              look (defaults to 1)
 
        
- 
root - a regular expression to find the root directory
 
        
- 
include - a regular expression to declare the files/directories
              that will be included in the list
 
        
- 
exclude - a regular expression to declare the files/directories
              that will not be included in the list
 
      
 
 
      
When you use this Generator, you must have the Jakarta Regexp package
         installed in your WEB-INF/libs directory.  Also, the DirectoryGenerator
         is not Cacheable so the results will be generated fresh each time.
 
      
The resulting XML looks like this:
 
  |   |   | 
 
  | 
  
<?xml version="1.0"?>
<directory xmlns="http://apache.org/cocoon/directory/2.0"
           name="C:\path\dir\"
           lastModified="135432153351"
           date="11 Jun 2001">
  <file name="C:\path\dir\file.xml" lastModified="135432153351"
        date="11 Jun 2001"/>
</directory>
  
 |   | 
 
  |   |   | 
 
 
 
    
    
FileGenerator
      
This generator and the ServerPagesGenerator will be your most used
         generators.  The FileGenerator reads an XML file from an input source,
         and converts it into a SAX stream.
 
      
When you use this Generator, you must have a JAXP 1.1 compliant parser
         installed in your WEB-INF/libs directory.  You may also use the Xerces
         parser bypassing the JAXP requirement.  The FileGenerator is Cacheable,
         so the results will only be re-read when the file changes.
 
    
    
FragmentExtractorGenerator
      
This generator is used in conjunction with the
         FragmentExtractorTransformer (more on that in the transformers
         section).  The FragmentExtractorTransformer splits an XML document into
         smaller parts so you can treat each smaller part as a unique document.
         To see this in action, check out the Cocoon supplied 
         samples
         and click on the SVG Welcome page.
 
      
This Generator caches the results from the FragmentExtractorTransformer
         for quick retrieval later.  It is Cacheable, so the fragments are
         generated once and the cached version is read from that point
         forward.
     
    
    
HTMLGenerator
      
This generator is used to read in an HTML file that may not be properly
         formatted to comply with XML standards.  The result is properly
         formatted XHTML.
 
      
This generator requires the Tidy.jar file installed in the
         WEB-INF/libs directory.  The HTMLGenerator is Cacheable, so the results
         can be cached for application speedup.
       
    
    
ImageDirectoryGenerator
      
This generator is an extension of the DirectoryGenerator, so it has the
         same requirements.  It extends the markup to include two new attributes
         for the "file" element: "height" and
         "width".  The ImageDirectoryGenerator reads every GIF and
         JPEG file to get the dimensions.
 
      
This generator is not Cacheable (just like the DirectoryGenerator).
 
    
    
JspGenerator
      
This generator executes a JSP file and parses the result.  The JSP must
         generate valid XML, and be a file in the context.
 
      
This generator requires a JAXP 1.1 compliant parser or Xerces if your
         environment will not allow you to install one.  It is also not
         cacheable so the results are generated each time.
 
    
    
PhpGenerator
      
This generator functions just like the JspGenerator, but with PHP
         templates.  The PHP must generate valid XML, and be a file in the
         context.
 
      
This generator requires a JAXP 1.1 compliant parser and the
         phpservlet.jar file that comes from http://php.net.  Install the files
         in the WEB-INF/libs directory.  The PhpGenerator is not Cacheable.
 
    
    
RequestGenerator
      
This generator converts the Request object into an XML representation.
         It is best used for debugging purposes.  The resulting XML follows:
 
  |   |   | 
 
  | 
  
<request xmlns="http://xml.apache.org/cocoon/requestgenerator/2.0"
         target="index.html" source="context://docs/index.xml">
  <requestHeaders>
    <header name="HOST_NAME">johny-bravo.infoplanning.com</header>
    <!-- repeat for each header -->
  </requestHeaders>
  <requestParameters>
    <parameter name="form-param">
      <value>1</value>
      <!-- repeat for each value in "form-param" -->
    </parameter>
    <!-- repeat for each parameter -->
  </requestParameters>
  <configurationParameters>
    <parameter
       name="configurations">context://WEB-INF/cocoon.xconf</parameter>
    <!-- repeat for each parameter -->
  </configurationParameters>
</request>
  
 |   | 
 
  |   |   | 
 
 
 
      
The RequestGenerator does not have any special requirements for
         libraries, and it is not Cacheable.
 
    
    
ScriptGenerator
      
The ScriptGenerator uses the Bean Scripting Framework (BSF) and an
         associated interpreter to generate valid XML.  If you add language
         support, you will have to embed the following configuration
         information:
 
  |   |   | 
 
  | 
  
<add-languages>
  <!-- repeat the following for each language: -->
  <language name="kawa-scheme"
            src="org.gnu.kawa.bsf.engines.KawaEngine">
    <extension>scm</extension>
    <!-- repeat for each file extension -->
  </language>
</add-languages>
  
 |   | 
 
  |   |   | 
 
 
 
      
The ScriptGenerator requires that you have the bsf.jar in your
         WEB-INF/libs directory along with any jars for the script interpreters
         you use.  The ScriptGenerator is not Cacheable.
 
    
    
ServerPagesGenerator
      
The ServerPagesGenerator is the XML Server Pages (XSP) engine.  It
         automatically compiles a new Generator at runtime based on an input
         XML file.
 
      
This generator requires that you have a JAXP 1.1 compliant parser and
         XSLT engine installed in your WEB-INF/libs directory.  It also requires
         you to have the JDK's tools.jar file in your classpath.  If you
         reference any packages, they must also be in your classpath.  The
         created generator is not Cacheable.
 
    
    
StatusGenerator
      
The StatusGenerator is another debug tool.  It provides status
         information for the Cocoon engine.  The resultant XML is in the
         following format:
 
  |   |   | 
 
  | 
  
<statusinfo xmlns="http://apache.org/cocoon/status/2.0"
            xmlns:xlink="http://www.w3.org/1999/xlink"
            host="johnny-bravo.infoplanning.com"
            date="7/16/2001 1:16:42 pm">
  <group name="vm">
    <group name="memmory">
      <value name="total"><line>5213255</line></value>
      <value name="free"><line>12321211</line></value>
    </group>
    <group name="jre">
      <value name="version"><line>1.3.1</line></value>
      <value name="java-vendor"
             xlink:type="simple"
             xlink:href="http://java.sun.com/jdk/1.3/">
        <line>Sun Microsystems Inc.</line>
      </value>
    </group>
    <group name="operating-system">
      <value name="name"><line>Windows 2000</line></value>
      <value name="architecture"><line>x86</line></value>
      <value name="version"><line>5.0</line></value>
    </group>
  </group>
  <value name="classpath">
    <line>C:\tomcat\lib\tomcat.jar</line>
    <line>C:\jdk1.3.1\lib\tools.jar</line>
  </value>
</statusinfo>
  
 |   | 
 
  |   |   | 
 
 
 
      
The results are not cacheable, and do not require any special
         libraries.
 
    
    
StreamGenerator
      
The StreamGenerator is used to convert the Request's InputStream
         into a SAX XML stream.  Alternately, it will accept the magic form
         parameter "form-name" and read the input stream that the
         parameter points to.
 
      
This generator requires the JAXP 1.1 compliant parser (or Xerces).  It
         is not cacheable.
 
    
    
VelocityGenerator
      
The VelocityGenerator is used to convert the output from the Velocity
         template engine to a valid XML stream.
 
      
This generator requires Jakarta Velocity and a JAXP 1.1 compliant
         parser installed in WEB-INF/libs.  It is not Cacheable.
 
    
  
  
Transformers
    
Transformers read a SAX stream, manipulate the XML stream, and send the
       results to the next Component in the chain.  All built in generators are
       in the package "org.apache.cocoon.generation".
 
    
CIncludeTransformer
      
The CIncludeTransformer looks for instances of the
         "ci:include" element, and will embed another XML resource in
         your document.  That resource can be in the sitemap so you can include
         the results of processed XSP pages.  An example follows:
 
  |   |   | 
 
  | 
  
<document xmlns:ci="http://apache.org/cocoon/include/1.0">
  <ci:include src="cocoon://my-resource.xml"
              element="body"
              ns="http://mycompany.com/my-resource/1.0"
              prefix="res"/>
</document>
  
 |   | 
 
  |   |   | 
 
 
 
      
The Transformer will read the results from the sitemap, and embed it
         into this document with a new root element "body" using a new
         namespace (xmlns:res="http://mycompany.com/my-resource/1.0").
         The results are not cached.
 
    
    
FilterTransformer
      
The FilterTransformer will look for instances of an element you specify
         using parameters, and will not forward any SAX events for that element
         or any child elements.  You can pass parameters to it to control how it
         behaves (note parameter names are case sensitive):
 
      
        
- 
element-name - The name of the element to filter
 
        
- 
count - the number of times the element will be filtered
 
        
- 
blocknr - the element number that filtering begins
 
      
 
 
    
    
FragmentExtractorTransformer
      
This is transformation half of the FragmentExtractor.  This transformer
         sieves an incoming stream of xml with embedded SVG images and replaces
         the images with a xlink locator pointing to the image.  Ultimately this
         could be much more general, but currently it is mainly an SVG
         extraction.
 
    
    
I18nTransformer
      
This is Cocoon's port of Infozone Group's I18nProcessor.  The
         word i18n is a shorthand for the longer word
         "internationalization" (starts with 'i', ends with
         'n', and has 18 letters in the middle).  The
         internationalization transformer allows you to look up references by
         key in an XML dictionary.  This allows you to support your same
         business processes in many different countries.  You have to pass
         parameters to it so that it knows how to process i18n requests:
 
      
        
- 
default_lang - The default language if the requested language does
              not exist (two character country code)
 
        
- 
avalailable_lang_X - Language available by the dictionary (two
              character country code).  Replace the 'X' in the
              attribute with a number (1, 2, 3).
 
        
- 
src - The location of the dictionary file.
 
      
 
 
      
The I18nTransformer reads the request parameter "lang" to
         determine which language to display to the user.  To translate text
         either embed the text inside the "i18n:text" element, or the
         attribute name inside the "i18n:attr" attribute.
 
  |   |   | 
 
  | 
  
<document xmlns:i18n="http://apache.org/cocoon/i18n/2.0">
  <body>
    <s1 title="Test Title" i18n:attr="title">
      <p>
       <i18n:text>This is replaceable text.</i18n:text>
      </p>
    </s1>
  </body>
</document>
  
 |   | 
 
  |   |   | 
 
 
          
    
    
LDAPTransformer
      
The LDAPTransformer is a class that can be plugged into a pipeline to
         transform the SAX events which passes through this transformer into
         queries an responses to/from a LDAP interface.
 
    
  
 
 
The Sitemap
 
  
This section is meant primarily as a reference for the Sitemap Manager.
     The person in this role needs to have a better understanding of the sitemap
     than any other role.  The sitemap is a relatively new concept, and as such
     is subject to refinement.  There have been a couple of proposals to replace
     it with something else, but nothing has been started yet.
 
  
The Sitemap is composed of three major parts: component declaration,
     resource declaration, and pipeline declaration.  You will only use a few
     different types of components in the sitemap: Generators, Transformers,
     Serializers, Readers, Matchers, Selectors, and Actions.  Generators create
     XML and pass the results in a SAX stream.  Transformers read a SAX stream
     and manipulate the results on the way through.  Serializers read a SAX
     stream, and convert it into the servlet's output stream.  Readers read
     an input stream and copy the results to the servlet's output stream.  
     Matchers and Selectors are used to choose how to process an incoming
     request.  Lastly, Actions are used to perform logic only functions (no
     display logic).
 
  
Below is the root element of all sitemaps:
 
  |   |   | 
 
  | 
  
<map:sitemap xmlns:map="http://apache.org/cocoon/sitemap/1.0">
</map:sitemap>
  
 
 |   | 
 
  |   |   | 
 
 
 
    
Choosing your Components
      
As previously discussed, you may choose a number of components to use
         in your own system.  This section identifies the different components
         you can use, and what they do.  Before we begin, I must state that
         every component is declared in the "map:components" element
         of the Sitemap:
 
  |   |   | 
 
  | 
  
<map:components>
</map:components>
  
 
 |   | 
 
  |   |   | 
 
 
 
      
Generators
        
All generators are declared within the "map:generators"
           element that is a child of the "map:components"
           element:
 
  |   |   | 
 
  | 
  
<map:generators>
  <map:generator name="file"
                 src="org.apache.cocoon.generation.FileGenerator"/>
</map:generators>
  
 |   | 
 
  |   |   | 
 
 
            
        
Most Generators do not have configuration information, so the
           "map:generator" element is left empty.  If there were
           configuration information to pass to the generator, it would be
           placed inside the element.  As you can see in the sitemap snippet
           above, you declare a generator with the "map:generator"
           element, a "name" attribute, and a "src"
           attribute.  The "name" attribute is how you will refer to
           this specific type of generator from this point forward.  The
           "src" attribute is the fully qualified class name of the
           Generator class.  In fact this construct is the same for all
           component types - the only thing that changes is the elements that
           declare the type of Component we are dealing with.
 
      
             
 
 |