Defining multiple document properties

You can define multiple document properties in a block of data in a PDF file. A block of data can be text, DataMatrix barcodes encoded as images, or both.
    Note:
  • Read the document property overview section to ensure you understand how document properties are used in RICOH ProcessDirector so you can take full advantage of your RICOH ProcessDirector feature.
To define multiple document properties:
  1. Open a PDF file in Adobe Acrobat Professional and either load a control file that contains a page group definition or define a page group.
  2. Left-click just above the top left corner of the block of data that you want to capture. Drag the mouse to draw a box around the block of data. You can later view the extracted values to verify your selection.
      Note:
    • Make the box you draw big enough to capture the longest occurrence with the most lines of data in your PDF files. Some characters in a PDF file have a larger white space buffer than other characters. For example, the left edge of a large capital letter might have up to a tenth of an inch of white space buffer that you might need to select in order to capture that letter or line.
  3. Select Define Multiple Properties from the popup menu.
  4. Type a Name for the block of multiple document properties. Do not use spaces or special characters (such as @, #, $, %, or - (dash)) in the name. You can use periods and underscores.
  5. Define which type of data to extract values from.
    • If you selected an area that only contains text, select Text under Select from.
    • If you selected an area that only contains barcodes, select Barcode image under Select from.
    • If you selected an area that contains both text and barcodes, select both Text and Barcode image.

      The text data is placed before the barcode data in the extracted string without an indicator of where the text data ends and the barcode data begins.

  6. Specify the page in each document from which document property data will be extracted. Do either of these steps:
    • Select Pages based on a rule, and then select a rule from the list. The default rule is First Front Only. You can also:
      • Click the Add content icon icon to define a new rule. See Defining a rule for more information.
      • Click the Rules Manager icon icon to go to the Rules Manager.
        Important:
      • The Last Back, Last Front, and Last Page rules do not work with the extraction of document property data.
    • Select Specific pages and type the page in each document that you want.

      If you specify multiple pages, RICOH ProcessDirector Plug-in for Adobe Acrobat extracts the document property data from the last specified page in each document. Examples:

      • You specify pages 2–4. If a document has four or more pages, the document property data is extracted from page 4. If a document has three pages, the document property data is extracted from page 3. If a document has two pages, the document property data is extracted from page 2.
      • You specify pages 2,4. If a document has four or more pages, the document property data is extracted from page 4. If a document has 2–3 pages, the document property data is extracted from page 2.
      • You specify pages 2–n. Because n represents the last page, the document property data is extracted from the last page if the document has two or more pages.
          Important:
        • If you specify only page n, RICOH ProcessDirector Plug-in for Adobe Acrobat does not extract the document property data from any page in a document.

  7. Use the Document Properties section of the definition window to select a document property and define the property extraction rule. This section contains the full text of the first line of the text data you selected. If you choose to edit an existing text block, this section contains all of the document properties you have defined for the text block. Follow these steps to define a new document property and the modifier extraction rule for it.
    1. Click the add icon (Add document property icon) to add a new document property definition row.
    2. Select a RICOH ProcessDirector document property from the Property list. You can define your own document property by typing in a document property name; however, you cannot use that document property in RICOH ProcessDirector. You should only define your own document properties when you are only using RICOH ProcessDirector Plug-in for Adobe Acrobat to export document property values to a text file.
    3. Select the Line of the text block from which you want to extract the selected document property. You can select the line using a top-down or a bottom-up reference. To select a line using a top-down reference, select 1 through n (where n is a positive whole number). To select a bottom-up reference, select Last or select Last - x (where x is the number of rows up from the last row). Instead of selecting the row value from the list, you can enter the row number directly into the Line field.
    4. Select the edit icon (Edit line icon) to display a Modify Text window where you define one or more modifier extraction rules to extract the exact document property you need.
    5. Choose one of the following modifiers:

      Content modifiers

      Modifier Action
      Remove Character Type one character or a blank character (use the space bar to type a blank character) that you want to remove from the value. The character is case-sensitive. Then select one of these buttons:
      • Remove all instances of the character

        The specified character is removed from all positions in the value.

        For example, an account number is: 324-1443255-11. You can type a - to remove all - characters from the value, producing 324144325511.

      • Remove leading characters

        The specified character is removed from the beginning of the value. For example, if you type a blank character, all blank characters are removed from the beginning of the value.

      • Remove trailing characters

        The specified character is removed from the end of the value. For example, if you type a blank character, all blank characters are removed from the end of the value.

      • Remove leading and trailing characters

        The specified character is removed from the beginning and end of the value. For example, if you type a blank character, all blank characters are removed from the beginning and end of the value.

      Substring by Position Select Beginning of Line or End of Line from the Starting From list. Select a number for First Position to indicate the location of the first character in the text value. Select a number for Number to Retain to indicate how many characters are retained.
      Substring by Delimiter Type a character or a blank character in the Delimiter field to indicate where the text value is split into separate string segments. The character and the text string are case-sensitive.

      Select Beginning of Line or End of Line from the Starting From drop-down menu.

      Select a number for First Position to define the position of the delimiter in the text string.

      Select a number for Number to Retain to define the number of text string segments to retain.

      These examples show how to select text string segments by specifying a delimiter:

      • For the account number 324-1443255-11, you can use - as the delimiter to split the value into these three text strings: 324, 1443255, and 11. Select Beginning of Line. To select the second and third text strings (1443255 and 11), select 2 for both First Position and Number to Retain.

      • For the mailing address Eldorado Springs CO 80025, you can use a blank character as the delimiter to split the value into these four text strings: Eldorado, Springs, CO, and 80025. Select End of Line.

        • To select the zip code, select 1 for both First Position and Number to Retain.

        • To select the state, select 2 for First Position and 1 for Number to Retain.

        • To select the city, select 3 for First Position and 10 for Number to Retain. By specifying 10 for Number to Retain, you can select city names with up to ten words.

      Pad with Character Select Beginning of Line or End of Line from the Padding Location list. Enter a character or a blank character as the padding character into the Character to Pad with field.

      Enter a number in the Minimum Padded Text Length field to define the minimum length of the text string. If the number of characters in the text string is less than this minimum length, padding characters are added until the text string equals the minimum length.

      When you use a modifier to define a text extraction rule, the Text to Modify field at the top of the Modify Text window contains the selected line plus any edits you make to the line. The Modified Value field to the right of a modifier displays the text that results when that modifier is applied to the text it received from either the modifier above it or the Text to Modify field (if you are defining the first modifier).

    6. Continue to apply modifiers until you extract the value you want from the selected line. Click the Add icon icon to add a new modifier. The Final Text field below the list of modifiers displays the final modified value, after all modifier extraction rules are applied.
      For the selected modifier, the Modifier Initial Text field at the bottom of the window displays the value before the modifier is applied. The Modified Text field displays the value after the modifier is applied.
    7. Use the modifier management icons near the top of the window to delete and reorder the modifier extraction rules. Use the Trash can icon icon to delete the selected modifier extraction rules. Use the up and down arrow icons to reorder the rules. The rules are applied to the line in order from top to bottom.
    8. Click the OK button to save the line extraction rule.
  8. Continue to define the other document properties you need to extract from a line in the block. You can select the same line that you used in another document property. If you need to delete a document property or want to rearrange their order, place a check mark in the box in front of a document property, and use the Trash can icon icon and the up down arrow icons.
  9. When you have finished defining document properties, click OK.
  10. Click Ricoh View Document Property Values to verify that the properties have the content you want.
  11. Optional: You can edit the text block definition by double-clicking its box or by right-clicking the box and selecting Edit.
  12. When you are ready to save all your enhancements to the PDF file, including the new document properties definition, click Ricoh Save Control File.
  13. Move the control file to a directory location used by a RICOH ProcessDirector server and include its name and location in a RICOH ProcessDirector IdentifyPDFDocuments step. This control file must also contain the page group definition that defines the documents in the PDF files processed by that step.