Data Analysis Techniques Using Hydrocarbon and Carbonyl Compounds
Note: EPA no longer updates this information, but it may be useful as a reference or resource.
Definitions
1996
List of PAMS Target Volatile Organic Compounds
VOC Data Retrieval
Importance of VOC Data Validation
Information Critical to VOC
Data Analysis
VOC Data Validation Tasks
Tips and Tricks for VOC QC and Data
Analysis
Examples of Data QC
Example Data Validation
Tool: VOCDat
Spatial and Temporal Characteristics
Summary
References
[Workbook Table of Contents] [Top
of VOC Data Analysis] [Previous
Section] [Next Section]
PAMS Target Species
55 C2-C12 hydrocarbons
3 carbonyl compounds
NMHC - Nonmethane Hydrocarbons
Sum of identified species and unidentified mass from C2 through
C12
NMOC - Nonmethane Organic Compounds
Sum of NMHC and carbonyl compounds
VOC - Volatile Organic Compounds
Used in this presentation interchangeably with NMOC
|
Note that many definitions of NMHC, NMOC, and VOC exist; for example, definitions vary widely depending on analytical techniques. |
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
| AIRS No. | Abbreviation | Compound | Class |
| 43206 | acety | Acetylene | Olefin |
| 43203 | ethyl | Ethylene | Olefin |
| 43202 | ethan | Ethane | Paraffin |
| 43205 | prpyl | Propylene | Olefin |
| 43204 | propa | Propane | Paraffin |
| 43214 | isbta | Isobutane | Paraffin |
| 43280 | 1bute | 1-Butene | Olefin |
| 43212 | nbuta | n-Butane | Paraffin |
| 43216 | t2bte | trans-2-Butene | Olefin |
| 43217 | c2bte | cis-2-Butene | Olefin |
| 43221 | ispna | Isopentane | Paraffin |
| 43224 | 1pnte | 1-Pentene | Olefin |
| 43220 | npnta | n-Pentane | Paraffin |
| 43243 | ispre | Isoprene | Olefin |
| 43226 | t2pne | trans-2-Pentene | Olefin |
| 43227 | c2pne | cis-2-Pentene | Olefin |
| 43244 | 22dmb | 2,2-Dimethylbutane | Paraffin |
| 43242 | cypna | Cyclopentane | Paraffin |
| 43284 | 23dmb | 2,3-Dimethylbutane | Paraffin |
| 43285 | 2mpna | 2-Methylpentane | Paraffin |
| 43230 | 3mpna | 3-Methylpentane | Paraffin |
| 43246 | 2m1pe | 2-Methyl-1-Pentene | Olefin |
| 43231 | nhexa | n-Hexane | Paraffin |
| 43262 | mcpna | Methylcyclopentane | Paraffin |
| 43247 | 24dmp | 2,4-Dimethylpentane | Paraffin |
| 45201 | benz | Benzene | Aromatic |
| 43248 | cyhxa | Cyclohexane | Paraffin |
| 43263 | 2mhxa | 2-Methylhexane | Paraffin |
| 43291 | 23dmp | 2,3-Dimethylpentane | Paraffin |
| 43249 | 3mhxa | 3-Methylhexane | Paraffin |
| 43250 | 224tmp | 2,2,4-Trimethylpentane | Paraffin |
| 43232 | nhept | n-Heptane | Paraffin |
| 43261 | mcyhx | Methylcyclohexane | Paraffin |
| 43252 | 234tmp | 2,3,4-Trimethylpentane | Paraffin |
| 45202 | tolu | Toluene | Aromatic |
| 43960 | 2mhep | 2-Methylheptane | Paraffin |
| 43253 | 3mhep | 3-Methylheptane | Paraffin |
| 43233 | noct | n-Octane | Paraffin |
| 45203 | ebenz | Ethylbenzene | Aromatic |
| 45109 | m/pxy | m/p-Xylene | Aromatic |
| 45220 | styr | Styrene | Aromatic |
| 45204 | oxyl | o-Xylene | Aromatic |
| 43235 | nnon | n-Nonane | Paraffin |
| 45210 | ispbz | Isopropylbenzene | Aromatic |
| 45209 | npbz | n-Propylbenzene | Aromatic |
| 45208 | 124tmb | 1,2,4-Trimethylbenzene | Aromatic |
| 45207 | 135tmb | 1,3,5-Trimethylbenzene | Aromatic |
| 45211 | oetol | o-Ethyltoluene | Aromatic |
| 45212 | metol | m-Ethyltoluene | Aromatic |
| 45213 | petol | p-Ethyltoluene | Aromatic |
| 45218 | mdeben | m-Diethylbenzene | Aromatic |
| 45219 | pdeben | p-Diethylbenzene | Aromatic |
| 45225 | 123tmb | 1,2,3-Trimethylbenzene | Aromatic |
| 43238 | ndec | n-Decane | Paraffin |
| 43954 | nundc | n-Undecane | Paraffin |
| 43502 | form | Formaldehyde | Carbonyl |
| 43551 | acet | Acetone | Carbonyl |
| 43503 | aceta | Acetaldehyde | Carbonyl |
| 43000 | sum | Sum of target NMHC | |
| 43102 | NMOC | Total NMOC |
Abbreviations from the PAMS manual.
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
AIRS Requirements (for importing data to VOCDat):
- Use Report #370 Raw Data Conversion
- Select Card Image File (not workfile)
- Select Action Code = I (for insert)
- Unmark "Generate Workfile" and "Exceptional Data Options"
Example entry for AIRS data retrieval of PAMS hydrocarbon and carbonyl compound data.
|
State |
County |
Site |
Parameter |
Method |
Interval |
Begin Date |
End Date |
| 25 |
013 |
0008 |
L | 940601 | 940930 |
- For sites approved as a PAMS, monitor type=P will download all data from the sites
Example entry for AIRS data retrieval of PAMS hydrocarbon and carbonyl compound data.
| State |
County |
Site |
Parameter |
Method |
Interval |
Begin Date |
End Date |
Monitor Type |
| 09 | 940601 | 940930 | P |
Turbochrome
- CT DEP has documented how to prepare specially formatted *.TX0 files from Perkin-Elmer Auto-GC data
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
- Remove calibration data from ambient data.
- Identify operating problems (e.g., cold trap temperature problems).
- Correct misidentification problems.
- Identify potential contamination problems (e.g., shelter off-gassing, handling problems, nearby sources).
|
Result is a more robust database for analyses. |
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 1

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
- Sample collection specifications and special features.
- Sampling location description.
- Audit, blank collection, collocated sampler descriptions.
- Sample analysis and instrument calibration descriptions.
- Example calculations of concentrations and any data conversions.
- Laboratory quality control (QC) descriptions.
- Reported units, site, date, sample start and end times, specification of daylight or standard time.
- Treatment of missing data and data below detection.
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
- Assess audit results (accuracy).
- Assess laboratory and field blank results.
- Assess collocated sample results (overall precision) and replicate analyses (analytical precision).
- Compare reported speciation to other databases (including special studies data).
- Prepare univariate statistics of concentration and weight fraction:
- Stratify data by date, time of day, and sampling location.
- Determine completeness of data. - Employ graphical procedures including scatter, box-whisker, time series, and fingerprint plots.
- Employ internal consistency checks using ratios of individual species or species group concentrations to other species, NMHC, and CO.
- Example guidelines for flagging samples:
- Carbon fraction of a species exceeds 20 percent of the NMHC or is 3s above the mean of that species.
- Total unidentified NMHC exceeds 15 percent (or user-defined) or is negative (i.e., reported total NMHC is less than the sum of identified species).
- Normally abundant species present in low concentrations when concentrations of other species are high.
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Overall
Total VOC --> Species groups --> Individual species
Inspect every species
Time Series
Inspect time series for the following:
- Large "jumps" or "dips" in the concentrations
- Periodicity of peaks, calibration carryover
- Expected diurnal behavior (i.e., isoprene)
- Expected relationships among species
- High single-hour concentrations of less abundant species
Scatter Plots
Prepare scatter plots of the following:
- Total NMOC vs. species group totals, vs. individual species
- Benzene vs. Toluene, Acetylene, Ethane
- Species that elute close together
- Isomers
- Other
Fingerprints
Prepare and inspect fingerprint plots for the following:
- Identify calibration data.
- Investigate hours surrounding suspect and invalid data.
- Obtain overall view of diurnal changes.
Additional Data
To further investigate outliers, use:
- Wind direction data
- Other air quality data (e.g., ozone, NOx)
- Subsets of data (e.g., high ozone days only)
- Industrial or agricultural operating schedules
- Traffic patterns
- Other
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
EXAMPLES OF DATA QC
- Species group time series - example of misidentification
- Species time series - example of contamination
- Species time series - change in species reporting
- Species time series - example of misidentification
- Scatter plot - example of misidentification
- Fingerprint - example of "typical" data
- Fingerprint - example of calibration data
- Species time series - example of calibration "carryover"
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 2

Time series plot of several species groups at Stafford, CT in 1994. Example of misidenitification of a parrafin for an unidentified peak. (Level 0, preliminary data, CT DEP)
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 3

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 4

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 5
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 6

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 7

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 8

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
VOCDat is a Windows-based, menu-driven program providing a graphical platform to:
- Display VOC data.
- Perform QC tasks on the data.
- Begin data analyses.
VOCDat was initially designed for auto-GC (hourly data) but may also be used for data collected over other intervals and in canisters.
VOCDat provides the following features:
- Import AIRS format or Turbochrome files.
- Edit data QC codes on-screen (keep log of changes).
- Compare data among sites.
- Prepare and print graphical displays.
- Export to AIRS format.
Figure 9



VOCDat allows the analyst to rapidly gain knowledge of the database. |
[Workbook Table of Contents] [Top
of VOC Data Analysis] [Previous
Section] [Next Section]
- Histogram
- Summary statistics
- Composition - box plots by time of day, date
- Inter-species relationships - scatter plot matrices
- Species
- Comparison between sites
- Change with time of day
- "Aged" vs. "Fresh" fingerprints
- Isoprene and acetylene diurnal profiles
- Surface vs. aloft VOC - Use of data subsets (stratifying data)
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 10

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 11

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 12

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 13

Box plot of total NMHC by hour (CST) during June 1996 at Chicago, IL (top) and Gary, IN (bottom). Note that nine data values greater than 500 ppbC were omitted from the Chicago box plot for clarity. (Level 1, AIRS data)
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 14

Box plots of total NMHC by date during June 1996 at Chicago, IL (top) and Gary, IN (bottom). Note that nine data values greater than 500 ppbC were omitted from the Chicago box plot for clarity. (Level 1, AIRS data)
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 15

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 16

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 17

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 18

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 19

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 20

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 21

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 22

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Figure 23

[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Most of the QC and data analysis techniques shown may be applied to all four PAMS site types.
| Analysis/Procedure | Objectives |
| Data Validation: Scatter plots Time series Fingerprints Box plots Summary statistics |
Prepare robust database: identify outliers, invalid data; investigate diurnal behavior, relationships, patterns |
| Frequency Distributions | Overall view of database |
| Spatial and Temporal: Bar, line, box plots Maps |
Identify and explore spatial, temporal variations in data |
| Inter-Site Comparisons: Scatter, relational plots |
Assess transport, emission sources, species relationships |
| Inter-Species Comparisons: Scatter plot matrices Linear regression |
Investigate species relationships, emission sources |
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
Bastable H.G., Rogers D.P., and Schorran D.E. (1990) Tracers of opportunity and pollutant transport in Southern California. Atmos. Environ. 24B, 137-151.
Henry R.C., Lewis C.W., and Collins J.F. (1994) Vehicle-related hydrocarbon source compositions from ambient data: the GRACE/SAFER method. Environ. Sci. Technol. 28, 823-832.
LADCO (1995) Lake Michigan Ozone Study: 1994 data analysis report, version 1.1. Report prepared by Lake Michigan Air Directors Consortium, Des Plaines, MI, May.
LADCO (1996) Lake Michigan Ozone Study: 1995 data analysis report, version 1.1. Report prepared by Lake Michigan Air Directors Consortium, Des Plaines, MI, April.
Lewis C.W., Henry R.C., and Shreffler J.H. (1996) An exploratory look at hydrocarbon data from the enhanced ozone monitoring network., (submitted for publication).
Lindsey C.G., Dye T.S., Main H.H., Korc M.E., Blumenthal D.L., Roberts P.T., Ray S.E., and Arthur M. (1997) Air quality and meteorological data analyses for the 1994 NARSTO-Northeast Air Quality Study. Final report in preparation for Electric Power Research Institute, Palo Alto, CA by Sonoma Technology, Inc., Santa Rosa, CA, STI-94362-1511-FR.
Lurmann F.W. and Main H.H. (1992) Analysis of the ambient VOC data collected in the Southern California Air Quality Study. Report prepared for the California Air Resources Board, Sacramento, CA by Sonoma Technology, Inc., Santa Rosa, CA, STI-99120-1161-FR, Contract No. A823-130, February.
Magliano K.L. (1996) Descriptive analysis and reconciliation of emissions and ambient hydrocarbon data. Draft SJVAQS/AUSPEX technical topic team #5 report prepared by California Air Resources Board, Sacramento, CA.
Main H.H. and Roberts P.T. (1993) Validation and analysis of the Lake Michigan Ozone Study ambient VOC data. Draft final report prepared for the Lake Michigan Air Directors Consortium, Des Plaines, IL by Sonoma Technology, Inc., Santa Rosa, CA, STI-90217-1352-DFR, April.
McLaren R., Singleton D.L., Lai J.Y.K., Khouw B., Singer E., Wu Z., and Niki H. (1996) Analysis of motor vehicle sources and their contribution to ambient hydrocarbon distributions at urban sites in Toronto during the Southern Ontario oxidants study. Atmos. Environ. 30, 2219-2232.
Nelson P.F. and Quigley S.M. (1983) The m, p-xylenes: ethylbenzene ratio, a technique for estimating hydrocarbon age in ambient atmospheres. Atmos. Environ. 17, 659-662.
NESCAUM (1994) Preview of 1994 ozone precursor concentrations in the northeastern U.S. 5/1/94 draft report prepared by the Ambient Monitoring and Assessment Committee of the Northeast States for Coordinated Air Use Management, Boston, MA.
Stoeckenius T.E., Ligocki M.P., Cohen B.L., Rosenbaum A.S., and Douglas S.G. (1994a) Recommendations for analysis of PAMS data. Final report prepared by Systems Applications International, San Rafael, CA, SYSAPP94-94/011r1, February.
Stoeckenius T.E., Ligocki M.P., Shepard S.B., and Iwamiya R.K. (1994b) Analysis of PAMS data: application to summer 1993 Houston and Baton Rouge data. Draft report prepared by Systems Applications International, San Rafael, CA, SYSAPP94-94/115d, November.
Systems Applications International, Sonoma Technology Inc., EarthTech, Alpine Geophysics, and A.T. Kearney (1995) Gulf of Mexico Air Quality Study. Vol. I: summary of data analysis and modeling. Final report prepared for U.S. Department of the Interior, Minerals Management Service, Gulf of Mexico OCS Region, New Orleans, LA, OCS Study MMS-95-0038.
Zielinska B., Sagebiel J.C., Harshfield G., Gertler A.W., and Pierson W.R. (1996) Volatile organic compounds up to C20 emitted from motor vehicles; measurement methods. Atmos. Environ. 30, 2269-2286.
[Workbook Table of Contents] [Top of VOC Data Analysis] [Previous Section] [Next Section]
![[logo] US EPA](http://www.epa.gov/epafiles/images/logo_epaseal.gif)