Frequently Asked Questions

OPUS Sample Pipeline

What does the sample pipeline do?
How do I run the sample pipeline?
How much disk space do I need to run the sample pipeline?
Do I need a database to run the sample pipeline?
Can I use the FITS files to do science?
How does OPUS make things easier?
What do I have to do to rerun the sample pipeline?
What is the difference between "cleaning" and "deleting" observations through the OMG?
What does it mean if a process is marked ABSENT on the PMG display, and what do I do?
Why do some GIF images fail the KW step in the sample pipeline?
How do I determine if the process status file is created?
How do I determine if the observation status files are created?
In what order are datasets processed in the pipeline?
Can I run more than one conversion process (g2f) in the pipeline?
How do I distribute processes over different machines?
How do I bring up multiple instances of the pipeline?
How do I determine if one of the processes has crashed?
How do I know when the processing is complete?
How do I turn the pipeline off?
What happens if I run out of disk space?
I can't find the output for this sample pipeline!
How do I display the results to verify the operation?
Why are the images in the FITS files monochromatic?
How do I verify that the FITS files comply with the standard?
Is there an easy way to list the keywords in a FITS file?
Can OPUS do parallel processing?
How do you make a process wait for the completion of two tasks?
How can I run parallel pipelines (paths)?

OPUS is distributed with a sample pipeline which demonstrates some of the capabilities of the OPUS system. The sample pipeline simply converts a stack of public GIF images from the Hubble Space Telescope archives into standard FITS files. In the sample pipeline, the GIF images provide the "raw telemetry" files that would normally feed a real data reduction pipeline.

The sample pipeline consists of seven separate tasks, and one interactive utility:

dat2gif: Move data into pipeline (rename files from *.dat to *.gif)....: This task is a shell script which is run interactively by the user to start data flowing through the pipeline. It requires two input parameters from the user; the script will prompt the user for them. See an example of using dat2gif in the description of how to run the sample pipeline.
gifin (IN): Introduce images into the pipeline....: This task is a shell script combined with an OPUS task. It demonstrates how easy it is to get an observation started in an OPUS pipeline. This task is a file poller, in that it looks for disk files in a particular directory meeting a specific file mask. When such a file is found, this pipeline step does the job of creating an OPUS Observation Status File (OSF) for the dataset, which makes the dataset visible to the next stages in the OPUS sample pipeline for more processing.
getkw (KW): Retrieve keywords from the "database"....: This task is a shell script which extracts the values of keywords from a "database". The sample pipeline uses an ASCII flat file as its database. The keywords and values are written to another ASCII file for use in later pipeline stages.
holdup (HD): Do some simulated work...: The task is a shell script which simulates doing useful pipeline work by just pausing. The purpose of this task is to display the parallel processing capability of the OPUS system. This task and the getkw task run in parallel on each dataset that comes through the pipeline. When a dataset completes the gifin pipeline stage, it is automatically registered to begin both the getkw and holdup pipeline stages simultaneously.
g2f (FT): Convert GIF to FITS files....: The work of the telemetry conversion is performed by a single executable, g2f. It converts a GIF image into a standard FITS file, ordering and writing the keywords to the FITS headers.
listhd (LH): List the header parameters from a FITS file...: This task is a shell script which runs a small OPUS task that dumps the contents of the FITS header keywords in a FITS file to a text file.
comprs (CZ): Compress a FITS file....: This task is a shell script that uses the UNIX "compress" command to compress a FITS file.
pipect: Pipeline statistics....: OPUS tasks can be triggered by the existence of a file of a specified type in a certain directory (see gifin), by the flags in the OPUS Observation Status File, or OSF, (see g2f), or by an absolute or relative timer. This task is triggered every 30 seconds by a relative timer. (A simple modification of the process resource file allows you to control the timer.) Statistics are compiled by counting the number of certain file types in the pipeline each time the task awakens. The first column of the log file contains a time stamp, the second counts the number of files not in the pipeline yet, and the remainder of the columns correspond to the stages of the sample pipeline -- with the exception of the holdup stage, which has nothing to count. The file can be used to measure the throughput of the sample pipeline processing. See the pipect.csh and pipect.resource files for more details.

How do I run the sample pipeline?

If this is the first time you are running OPUS after the installation, you might first need to source your updated .cshrc file:

  %source ~/.cshrc

Performing that step will also source the opus_login.csh because the installation script now adds a line in your ~/.cshrc file to source the opus_login.csh. This change is due to new start up procedures for processes.

Next run the Process Manager (PMG) as a background task:

   % pmg >& /dev/null &

You will get a Motif application window that looks something like this:

PMG Initial Window

Now start up the Observation Manager (OMG) to monitor datasets as they progress through the sample pipeline. Type,

   % omg >& /dev/null &

and another Motif application window should appear on your display:

OMG Initial Window

The OMG can monitor a single path at a time, and by default should be monitoring the sample pipeline path, g2f. The first time you bring up the OMG there will be no datasets listed since you haven't started any processes yet.

Next go back to the PMG and start the sample pipeline processes. From the "File" menu, click on "Select Process".

PMG File Menu: Select Process

You should bring up one copy of all the processes listed:

PMG Select Processes Window

Note that "g2f" is the name of one of the pipeline processes, as well as the name we've chosen for our sample path. Don't let that confuse you. This simply means we have the following files in the OPUS_DEFINITIONS_DIR directory:

   g2f.path
   g2f.resource

The path file describes the directories for all the processes we plan to bring up in the sample pipeline; the resource file describes the attributes of the single process g2f.

See the question under the PMG section about bringing up a pipeline process for more details about selecting processes. Once you have finished your process selection, click on the "Start" button. You should see a number of processes listed in the PMG, all in the IDLE state. They are waiting for work to do.

PMG Window with Processes Started

To start data flowing through the pipeline, the input GIF data files need to be moved to the input directory for the sample pipeline. This input directory is defined in the resource file for the gifin process, under the INPATH entry, which looks something like:

   INPATH       = gif_data ! Directory where the input files are found
                           ! This entry is in gifin.resource

Find the corresponding entry for "gif_data" in your g2f.path file

   gif_data     =        ~/opus_test/g2f/input/

So in this example the gifin process will look for input data files in the directory ~/opus_test/g2f/input (the translation of INPATH for the gifin process).

You might wonder why you have to move the data from the configured area to your local data tree -- why the software doesn't just look for the files in the configured tree. There are two reasons.

First, the data files get renamed during pipeline processing. If more than one user is trying to run the pipeline from the same configured tree at the same time, the renaming of these files would conflict between users. Therefore, it's safer and cleaner to move the files to be processed into a user's local area. (Or, if you're running the pipeline from the CD-ROM, you cannot rename the files in place!)

The other reason for copying the input data files into your own directory is that by doing so you can choose just how much data you want to process at a given time. There are 256 GIF data files. You may not want to drop them in the pipeline all at once! You certainly can do that, but you may want to start out with just a few until you are satisfied that your environment is set up correctly.

We have provided an interactive utility to help you move the data. The utility is called dat2gif (it will also rename the files from "*.dat" to "*.gif" during the move). It requires two input parameters; the script will prompt you for them. Wildcards are accepted:

 % dat2gif
    File(s) to copy: /home/joe/gif/gif95*
    Path name (e.g., g2f): g2f

As files appear in the gifin input directory, the pipeline processes listed in the PMG should begin to list dataset names -- the status fields should change from the IDLE state to indicate work is being done.

PMG Processing Data

At the same time, datasets should start to appear on the OMG display as well, indicating which processing stages have completed (c), are awaiting processing (w), are currently processing (p), or have encountered an error (e).

OMG Processing Data

When you have reached this stage you are running an OPUS pipeline!

How much disk space do I need to run the sample pipeline?

The sample input datasets require 30MB of space just for the GIF files. The FITS output which has keyword information added and is stored in a less efficient format requires an additional 80 MB of space, but because of the file compression stage in the sample pipeline not all of this space is needed. Intermediate files will add another 1 MB of space to these requirements. The data files and the OPUS executables themselves require less than 5 MB of space. In all you'll need about 115 MB of disk space for the sample pipeline.

Do I need a database to run the sample pipeline?

No.

The sample pipeline uses only flat files for input and output. However, there is nothing to prevent you from writing your own database-dependent applications. (The HST pipeline uses a database extensively, for example.) But in order to keep the sample pipeline simple, we have removed any database dependency.

Can I use the FITS files to do science?

No.

First of all the public release images (the original GIF files in the sample pipeline) are often composites, not raw science data. They have been processed to make a scientific point, but in so doing, some of the original signal has been modified or lost.

Second, the header information in the FITS files is simulated. Since the original public release images were often composites of separate exposures it was not possible to correlate the image with a specific observation. The keyword values are taken from an exposure which might be similar to one of the composite observations.

How does OPUS make things easier?

Compared with the HST pipeline, the sample pipeline is in fact a simple demonstration. While running all the observations through the sample pipeline can take some time, it is trivial compared with a true calibration pipeline. However, you can still get a feeling for, and experiment with, creating multiple instances of processes and distributing processes over several machines.

First, on the same machine, bring up several copies of the g2f task. Use the Process Manager (PMG) to select the same task on the same machine several times. The pipect task will monitor the pipeline throughput and produce a summary report when it is terminated. That report, described in the first question, as well as the wall clock time required to complete the sample dataset, should demonstrate the benefit of multiple instances.

It is still possible to swamp the resources on a single machine with too many tasks. Try bringing up several copies of the g2f process on different machines to determine what the best mix of tasks and machines best suits your configuration.

What do I have to do to rerun the sample pipeline?

First you should ensure that your output directories are clean. The simplest way to accomplish this is to delete the data using the OMG delete option.

OMG Manage Menu

This will delete all the data from the processing directories. You will not have to terminate/restart any of the pipeline processes to rerun the data, but do note that you will still be using the same pipect.log file as was used in the initial run of the data (The pipect.log is the output from the pipect process -- the pipeline process that tracks pipeline statistics). If you wish to start with a new pipect.log file, you will have to terminate your pipect process and then manually delete (or rename, if you wish to save) the file from

   ~/opus_test/[path]/fits/pipect.log

(where [path] is the path the data is being processed through [e.g., "g2f"]). You can then start up a new pipect process. All that you need to do now is run the dat2gif command to copy the gif*.dat files into the input directory as you did for the initial run of the data.

You will want to make sure that the pipeline output directories are clean before rerunning the data or anomalous pipeline behavior will result.

What is the difference between "cleaning" and "deleting" observations through the OMG?

"Cleaning" an observation removes only the OSF from the OPUS blackboard. "Deleting" an observation removes both the OSF and the data associated with it in the OPUS pipeline directories.

What does it mean if a process is marked "absent" on the PMG display, and what do I do?

An "absent" entry in the PMG is an indication that the process did not exit gracefully (i.e., it crashed). All process output is saved in a process-specific log file in the OPUS_HOME_DIR directory. The first thing to do is to determine why the process terminated by examining that log file.

The observation which was being processed when the task exited can easily be identified. One way is to examine messages in the process log file just before the crash. It is good practice to have each process print the name of the exposure it is beginning to process for this kind of troubleshooting. Another way to determine what the process was doing when it exited is to use the OMG. The OMG column for the "absent" process should be marked with an "x" on one of the lines on the display.

Often the problem can be traced to "bad data", an unexpected value in the data stream which was not handled correctly by the software. If you have confidence that no other observation is likely to have the same problem, then you can use the PMG to start up another copy of the failed process. Or, if you already have multiple instances of the failed process running in the pipeline, then you probably will not notice any consequence of the failed process besides the failure of a single observation; other instances will just have to do the work of the failed process.

Why do some GIF images fail the KW step in the sample pipeline?

Because there are five datasets which do not have an entry in the sample database:

  gif9508x01
  gif9508x02
  gif9508x03
  gif9508x04
  gif9508x05

We intentionally inserted a few error cases to demonstrate how the Observation Status Files (OSF) are set in the case of a processing error.

How do I determine if the process status files are created?

The Process Manager (PMG) will display a line for each process so you can monitor what that process is doing. If there is a line for that process on the PMG display, then there must be a corresponding process status file.

You can check this by listing the files in your OPUS_HOME_DIR directory:

   %ls $OPUS_HOME_DIR/*_*

The process status files probably all contain at least one underscore.

How do I determine if the Observation Status Files are created?

The Observation Manager (OMG) displays one line for each observation (or dataset). Observation Status Files (OSF) are contained in a path-specific directory. One way to determine where the OSFs are kept is to use the PMG to display your path file. There you will see the definition of the OPUS_OBSERVATIONS_DIR.

Alternatively, to determine where these files are kept, search the path file for the OPUS_OBSERVATIONS_DIR definition:

   %set pathname = `osfile_stretch_file OPUS_DEFINITIONS_DIR:g2f.path`
   %grep OBS $pathname
    OPUS_OBSERVATIONS_DIR = /home/mydir/obs
   %ls /home/mydir/obs/

The OPUS utility osfile_stretch_file is used to find the first disk file g2f.path under the "stretched" environment variable OPUS_DEFINITIONS_DIR (similar to a Unix path). Since OPUS_DEFINITIONS_DIR can be defined to stretch through one or more local directories and then through the OPUS system directories, the utility is used to search the directories in the stretch for the first occurrence of the file. This allows the user to create local copies of some OPUS system files that override the official copies in the OPUS system directory tree.

In what order are datasets processed in the pipeline?

In the Unix environment OPUS uses the glob command to determine what is waiting to be processed. For better performance, by default, OPUS uses the "no-sort" option of glob so that filenames are returned in no particular order. Thus it is difficult to determine which observation will enter the pipeline first if there are many candidates.

To force the use of the sort option of glob, set an environment variable in your login shell or OPUS_DEFINITIONS_DIR:opus_login.csh file for

setenv OPUS_SORT_FILES

This will cause OPUS to perform all file searches using the sorted glob (which collates in LC_COLLATE order, see your operating system documentation for more information). This likely will result in alphabetically ordered searches.

Can I run more than one conversion process (g2f) in the pipeline?

Yes, and you should try this. Having multiple instances of a process in the pipeline is one way to increase your throughput significantly. Use the Process Manager to start up several copies of the g2f task.

However, you cannot necessarily run multiple copies of every process. For instance, you will find if you attempt to start up more than one gifin task, that you will only get one instance of it. This is because it is restricted in the OPUS_DEFINITIONS_DIR:pmg_restrictions.dat file.

How do I distribute processes over different machines?

In the Process Manager (PMG) you need to select the different machines in the "Select Process" dialog box. It is easy to add new machines to the list of available nodes.

How do I bring up multiple instances of the pipeline?

Each instance of a pipeline runs on a different path. First you must define an alternate path (i.e., create a new *.path file in OPUS_DEFINITIONS_DIR.

Then, in the Process Manager (PMG), when selecting the processes to run in your pipeline, specify the new path you have defined.

The easiest way to do this is to save your pipeline as a file, edit the path names in that file, save the file with a new name, and load that pipeline definition in the PMG.

How do I determine if one of the processes has crashed?

Of course this shouldn't happen. If it does you should notice two things. First the entry for that process should be marked "absent" in the Process Manager (PMG). Second, the observation it was working on at the time the problem occurred should be marked with an "x" on the Observation Manager (OMG) display.

How do I know when the processing is complete?

In the Process Manager window, all of the processes will be in an IDLE state. This means that no process is finding anything to do. In the Observation Manager window, all of the observations (datasets) will have completed the final task in the pipeline.

How do I turn the pipeline off?

You can halt the tasks from the Process Manager (PMG). From the "Manage" menu, select "Halt OPUS".

PMG Manage Menu: Halt OPUS: All

What happens if I run out of disk space?

The process resource file for most of the processes has a keyword called MINBLOCKS. Before the task begins processing a dataset, OPUS will first check whether there are the specified number of free blocks still available on the output device (OUTPATH). Whenever the minimum disk space requirements are not met, the task hibernates and the PMG displays the status of "iowait" for the task.

Note that a "block" is 512 bytes. An easy way to view the resource file is described in the Process Resource Files document.

I can't find the output for this sample pipeline!

The output of each step in the pipeline is placed in the OUTPATH directory, which is specified in the corresponding process resource file. In the g2f.resource file, OUTPATH is defined as "fits_data":

OUTPATH         = fits_data  ! Directory where output files are written

And in the g2f.path file (if that's the path you are running in), "fits_data" is defined as:

   fits_data    =      ~/opus_home/g2f/fits/

So in this example the g2f process will place output data files in the directory ~/opus_home/g2f/fits (the translation of OUTPATH for the g2f process).

An easy way to view the resource file is described in the Process Resource Files document. An easy way to look at your path file is to bring up the "Select Process..." option in the "File" menu of the Process Manager.

PMG File Menu

Then double-click on the name of the path you are interested in.

PMG Choose Path

How do I display the results to verify the operation?

You can use a public display package to view the .gif files which are input to the sample pipeline. We often use the shareware package called xv and you can get the software via anonymous ftp from ftp.cis.upenn.edu. Further information about xv can be found by e-mailing xv@devo.dccs.upenn.edu. Alternatively you can embed the .gif files in a web-browser and view them with the browser.

xv is able to display the FITS files as well as the GIF files. The FITS files can also be viewed with a true FITS display package. SAOimage can handle FITS files, as will IDL. xv will also decompress compressed images, so it can be used to view the compressed FITS files as well.

Why are the images in the FITS files monochromatic?

The GIF files are color-mapped, but the FITS standard does not have an equivalent color convention. We could have produced three images (R,B,G) with the mapped values, but the standard display packages are not designed to handle composites. So the g2f task computes the weighted sum of the brightness in each color dimension, and uses that to construct a monochromatic gray-scale image.

How do I verify that the FITS files comply with the standard?

We use a verification tool written with the FITS++ package. FITS++ is used both by NRAO and the Space Telescope Science Institute. That tool (invoked with the command fitsverify) is in the distribution. All you have to do is run it, supplying the filename as an argument:

   %fitsverify picasso_raw.fits
   FITS++ Verification Program Version 1.10
   Wed Aug 20 11:27:18 1997

   ===================================================================
   FITS Verification for file:  picasso_raw.fits
   
   
   
   ===================================================================
   Summary contents of FITS file: picasso_raw.fits
   
   0:  Primary Array ( BYTE ) 2 dims [370,495]
                   183150 bytes, 243 header lines, 71 FITS blocks
   No special records.
   
   ===================================================================
   No problems were encountered.

Note that the version number may change over time.

Is there an easy way to list the keywords in a FITS file?

Yes.

We have included in the distribution a task called listhead which will list the headers of the FITS file. The task runs as part of the sample pipeline and produces ASCII files of header keywords and values in the output directory.

The task can also be run interactively. For example, just type:

   %listhead 9707_raw.fits

This command will produce a file called 9707_raw.lis which contains the keyword lines in ASCII format. The sample pipeline would name the ASCII header file for this example: gif9707.hdr.

The listhead task can now handle wildcards in the source argument, as well as directory/logical names in the destination argument.

listhead \*.fits: lists FITS headers for all FITS files in current directory to the same directory, with output names *.lis
listhead \*.fits N_DADS_DIR: lists FITS headers for all FITS files in current directory to the directory defined by the N_DADS_DIR environment variable
listhead OCAL:\*.fits O_DADS_DIR: lists FITS headers for all FITS files in the directory defined by the OCAL environment variable to the directory defined by the O_DADS_DIR environment variable
listhead o3s41010q.fits -f MYDISK:o3s41010q.hdr: lists FITS header for a specific FITS file to a specifically named output file

The "\" is required before the wildcard. The "-f filename" syntax is REQUIRED to use an output filename other than inputfilename.lis.

Can OPUS do parallel processing?

Yes. In the examples below, the steps in the pipeline are:

IN: Initialization
KW: Keyword selection
HD: Hyperdynamic Defreezer
FT: GIF to FITS converter
LH: List FITS header
CZ: File compression

Also the files discussed below are all contained in the OPUS_DEFINITIONS_DIR directory.

The normal pipeline processing is linear and sequential


	IN------>KW------>FT------->LH------->CZ

OPUS allows you to have any number of these processes up at a time


			  FT
	IN------>KW------>FT------->LH------->CZ
			  FT                  CZ
			  FT

In addition we have provided another sample task that runs in parallel with the getkw (KW) task. Since the KW and the HD tasks do not require exclusive access to the same resources, they can and do run simultaneously:


		 HD------>FT
	IN------>KW------>FT------->LH------->CZ

The g2f (FT) process will wait until both the HD and the KW tasks are complete for a dataset before proceeding.

How do you make a process wait for the completion of two tasks?

In the process resource file you just specify that the "trigger" for the dependent process requires that both prior tasks are complete. For example, the g2f process (FT) specifies:


   OSF_RANK = 1                    ! First Trigger
   OSF_TRIGGER1.FT = w             ! Need a 'Wait' in FT stage
   OSF_TRIGGER1.KW = c             ! Need a 'Complete' flag in KW column
   OSF_TRIGGER1.HD = c             ! Need a 'Complete' flag in HD column
   OSF_TRIGGER1.DATA_ID = gif      ! Also need the Data_id set to GIF

How can I run parallel pipelines (paths)?

This is a more advanced OPUS feature. These steps are not required to run the sample pipeline. They demonstrate extended capabilities of the OPUS system.

By way of example, assume you wanted the following three pipelines:

production pipeline:


                  HD------>FT
	IN------->KW------>FT-------->LH------->CZ
                           FT                   CZ
                           FT

quicklook pipeline:


	IN------->KW------>FT-------->LH

reprocessing pipeline:


                  HD------>FT
	IN------->KW------>FT-------->CZ

By slightly modifying the process resource files, and changing their names to make them distinct, you can set up a variety of pipelines which can be run simultaneously.

The TASK line of your new file must point to the name of the new resource file. So, for example, if you are copying and renaming the g2f.resource file to myg2f.resource, you need to change the task line from

	TASK = < g2f -p $PATH_FILE >

	TASK = < g2f -p $PATH_FILE -r myg2f >

Also, the process resource files must not refer to stages which are not present in the path. Thus if the g2f process ordinarily refers to the HD process:

OSF_TRIGGER1.FT = w             ! Need a 'Wait' in FT stage
OSF_TRIGGER1.KW = c             ! Need a 'Complete' flag in KW column
OSF_TRIGGER1.HD = c             ! Need a 'Complete' flag in HD column

and if one pipeline does not contain a HD task, then OPUS will complain violently about the third line above. Thus to ensure that the task will run in such a pipeline you need to modify your new resource file and remove any reference to the HD task. If the task is mentioned in more than one resource file, be sure to copy, rename, and modify all of them.

The next step is to determine how the data in each of the pipelines is to flow; what directories will be used. In the simple sample pipeline you might just want three different input directories, and three different output directories. In that case you need to create those directories in your environment. For example, if you started with:


	Input directory: 	/home/my_name/opus_test/g2f/input/
	Output directory:	/home/my_name/opus_test/g2f/fits/

You might then just create three unique directories:


	Input directories:	/home/my_name/opus_test/prod/input/
				/home/my_name/opus_test/quick/input/
				/home/my_name/opus_test/repro/input/
	Output directories:	/home/my_name/opus_test/prod/fits/
				/home/my_name/opus_test/quick/fits/
				/home/my_name/opus_test/repro/fits/

The next step is to define three different path files. You can use the g2f.path as a template for the three paths. You might start out with:


   STAGE_FILE            = OPUS_DEFINITIONS_DIR:g2f_pipeline.stage
   OPUS_OBSERVATIONS_DIR = /home/my_name/opus_test/g2f/obs/
   gif_data              = /home/my_name/opus_test/g2f/input/
   fits_data             = /home/my_name/opus_test/g2f/fits/
   hdr_data              = /home/my_name/opus_test/g2f/fits/
   sample_db             = /home/my_name/opus/db/

You would want to create three paths with the names of the different directories substituted:

prod.path


 STAGE_FILE             = OPUS_DEFINITIONS_DIR:prod_pipeline.stage
 OPUS_OBSERVATIONS_DIR  = /home/my_name/opus_test/prod/obs/
 gif_data               = /home/my_name/opus_test/prod/input/
 fits_data              = /home/my_name/opus_test/prod/fits/
 hdr_data               = /home/my_name/opus_test/prod/fits/
 sample_db              = /home/my_name/opus/db/

quick.path


 STAGE_FILE             = OPUS_DEFINITIONS_DIR:quick_pipeline.stage
 OPUS_OBSERVATIONS_DIR  = /home/my_name/opus_test/quick/obs/
 gif_data               = /home/my_name/opus_test/quick/input/
 fits_data              = /home/my_name/opus_test/quick/fits/
 hdr_data               = /home/my_name/opus_test/quick/fits/
 sample_db              = /home/my_name/opus/db/

repro.path


 STAGE_FILE             = OPUS_DEFINITIONS_DIR:repro_pipeline.stage
 OPUS_OBSERVATIONS_DIR  = /home/my_name/opus_test/repro/obs/
 gif_data               = /home/my_name/opus_test/repro/input/
 fits_data              = /home/my_name/opus_test/repro/fits/
 hdr_data               = /home/my_name/opus_test/repro/fits/
 sample_db              = /home/my_name/opus/db/

Finally, since each path has a different number of steps, you should create its own path-specific pipeline.stage file. The prod_pipeline.stage file would be identical to the g2f_pipeline.stage file, and only has to be copied:

   %cd /home/my_name/opus_test/definitions
   %set fname = `osfile_stretch_file OPUS_DEFINITIONS_DIR:g2f_pipeline.stage`
   %cp $fname prod_pipeline.stage

However, the other two paths have fewer steps and require modification of the g2f_pipeline.stage file. For example, the quick_pipeline.stage file would appear as:


   NSTAGE = 4
   STAGE01.TITLE = IN
    STAGE01.DESCRIPTION = "GIF INIT"
   STAGE02.TITLE = KW
    STAGE02.DESCRIPTION = "Database select"
   STAGE03.TITLE = FT
    STAGE03.DESCRIPTION = "GIF to FITS"
   STAGE04.TITLE = LH
    STAGE04.DESCRIPTION = "List FITS Header"

Then you need to change which path is being monitored in an OMG. To clean, copy, or delete the data you create in your new path will require corresponding configuration files.

Frequently Asked Questions

OPUS Sample Pipeline

OPUS Sample Pipeline

Top of Sample Pipeline FAQ

Top of OPUS FAQ