Emissions gridding surrogates and related files associated with the "new" surrogates developed 2003 and revised spring 2004 * If you identify issues or have comments on the data * use contact below. Contact: Madeleine Strum, EPA Office of Air Quality Planning and Standards Emissions, Monitoring and Analysis Division Air Quality Modeling Group strum.madeleine@epa.gov 919 541 2383 ----------------------------------------------------------------------- HISTORY OF FILE CHANGES ----- Original files posted August 2003 change 3: May 2004 A) replaced the ports surrogate (800) with a new ports surrogate which contains additional ports not in the previous databases (new shapefile posted too) B) replaced the navigable waterways surrogate (810) with an activity-weighted navigable waterways surrogate which uses data on activity on waterway links (new shapefile posted too) 810 is now gapfilled with the old navigable waterways surrogate (no number) and then ports and water as tertiary and quaternary C) updated geodatbase for US per above changes and change E D) changed water surrogate (350) (in SMOKE input files only) to remove a gridcell in 38009 which is miscoded due to an error in the shapefile Note that we didn't correct the shapefile E) replaced airport area surrogate (700) to correct base data gridding errors F) added new categories to the surrogate cross reference file for U.S., changed surrogate assignments for some categories G) replaced Canadian surrogates due to errors detected in the gridded data; replaced geodatabase file H) added new categories to Canadian cross reference file I) added land area files formatted for SMOKE's biogenic processing ("bgpro" files) J) Updated surrogate documentation workbook, data dictionary and surrogate processes document to reflect above changes Change 2: OCTOBER 2003 Major changes were A) change the rural area surrogate (400) to rural land area. Rural land area was computed by subtracting out the water (using NLCD data) from rural (based on census) area B) change the secondary/tertiary/quadrary surrogates used for gapfilling for many of the primary surrogates C) change gap filling program routines (efficiency and easier QA) updated files are: 1) rural land area shape file: replaces rural area 2) data dictionary for US shape files: added rural land area and removed rural area 3) surrogate documentation workbook changed references to "rural area" to "rural land area" and put in proper definition of rural land area, revised shapefile names, documented new choices of 2ndary, tertiary surrogates for gap filling, and added information on quadrary surrogates, made corrections to information on how we developed airport points file 4) surrogate_development_process document: corrected gapfilling information in Table 1 and changed rural area to rural land area, added limitations of the airport area and airport points surrogates. 5) gap_filling spreadsheet change due to new secondary, tertiary, quadrary surrogates to use for gap filling for some of the surrogates 6) SMOKE-ready surrogate (amgpro)files: rural land area change and gap filling changes. 6) Updated geodatabase: rural land area change 7) Updated SAS programs (new gapfilling.sas program) Change 1: SEPTEMBER 2003 A) corrected shapefiles for the airport area surrogate (bad FIPS codes) and roads surrogates (missing roads in Chatham County, Georgia) new airport area shapefile is dated 08/29/03, new roads shapefile is dated 08/21/03. B) Updated geodatabase to reflect item A. New Geodatabase is dated 09/11/03. C) corrected SMOKE files for US due to items A and B and a modification to the previous gap-filling approach. Under new approach, we gap-fill ALL surrogates for all counties in which the primary surrogate was missing, regardless of whether the surrogate is needed for that county for the particular inventory being processed. Thus, even though most counties in the US have no orchards/vineyards,the orchards/vineyard surrogate is complete for all counties, because for ALL counties with no orchards/vineyards, the orchard/vineyard surrogate is gap-filled with agricultural land. Users need to be aware that for gap-filled surrogates, the actual surrogate name (e.g., orchards/vineyards) is only applicable to counties that have that geographic feature. For counties without that geographic feature, the surrogate is actually the secondary or tertiary surrogate listed in the surrogate codes spreadsheet. The gap-filling spreadsheet lists, for each surrogate, which counties were gap-filled. The result of the gap-filling change is larger file sizes. New US SMOKE files (4,12,36 km grids) dated 09/08/03. D) corrected SMOKE file for 36 km grid for Canada due to errors in header line. New Canada SMOKE file (36 km grids) is dated 09/15/03. E) Updated gap-filling spreadsheet related to item C. New gap-filling spreadsheet is dated 09/25/03. F) Updated gap-filling SAS program to gap-fill all surrogates. New program called "gap_filling.sas" replaces program called "tertiary.sas" G) Updated SMOKE cross-reference (amref) files to add onroad SCCs that were not present in the previous (version 3 draft) HAP inventory and thus were not contained in the previous cross reference file. New SMOKE-ready Xref file is dated 09/15/03. H) Updated SCC-to-surrogate cross-reference (surrogate assignments) spreadsheet per item G. New spreadsheet is dated 09/15/03 I) Modified Surrogate_development_process.pdf (item 3 below) to reflect gap-filling procedural changes (item C) above and add a summary of available US surrogates. New documentation file is dated 09/25/03. J) Replaced census-based Indian reservation boundaries shape file with data from the Bureau of Indian Affairs (BIA). Zip file dated 9/22/03. K) Changes to surrogate documentation workbook to reflect updated secondary and tertiary surrogates and to document updated tribal lands shape file. New file is dated 9/25/03. L) Updated data dictionary to remove census-based Indian reservations New file is dated 9/24/03. ----------------------------------------------------------------------- 1) SMOKE-READY FILES: GRIDDED SURROGATE RATIOS IN SMOKE FORMAT amgpro.12km_041204.canadian.gz amgpro.12km_041604.us.gz amgpro.36km_041204.canadian.gz amgpro.36km_041604.us.gz amgpro.4km_041204.canadian.gz amgpro.4km_041604.us.gz DESCRIPTION: These are the surrogate profiles in SMOKE format. The header in each file gives a description of the grid. The SMOKE G_GRIDPATH and GRIDDESC files will need to be modified to be consistent with the header. All three grid resolutions are in Lambert Conformal projection and cover the same grid. The SMOKE utility, srgtool, can be used to window or resolve the 4km amgpro file to a coarser grid resolution for other modeling grids. The grid definition for the 4km grid is as follows: Projection Lambert Conformal Units Meters X Origin -2736000 Y Origin -2088000 X Cell Length 4000 Y Cell Length 4000 Columns 1332 Rows 1008 Alpha 33 Beta 45 Gamma -97 X Center -97 Y Center 40 The U.S. SMOKE files are gap-filled which means that if a particular surrogate was not available for that county, a secondary or tertiary surrogate was used in its place. For example, most counties have no orchards/vineyards in them. However, the "orchard/vineyards" surrogate has data for every county because we gap-filled using the "agricultural land" surrogate for counties with no orchards/vineyards. Thus, the "orchards/vineyards" surrogate is only "orchards/vineyards" for a few counties. The gap-filling spreadsheet we supply (item 6 below) lists, for each surrogate, which counties were gap-filled. The Canadian profiles were developed by re-gridding 10km gridded surrogates provided by Environment Canada. The Canadian surrogate cross-reference file was also supplied by Environment Canada. These surrogates were not gapfilled, therefore, a SMOKE default surrogate may be required for the Canadian profiles (use 900, population). Canadian emissions are spatially allocated from Province to grid so a Province level inventory must be used with these surrogates. More documentation on the surrogates is presented in items 3 through 7, as described below. 2) SMOKE-READY FILES: SCC-TO-SURROGATE CROSS-REFERENCES amgref_us_051704 amgref_canada_041604 DESCRIPTION: These two files contain the US and Canadian SCC-surrogate cross-references, in SMOKE format. 3) DOCUMENTATION: PROCESS FOR DEVELOPING SURROGATES Surrogate_development_process052804.doc DESCRIPTION This document outlines the process used to create SMOKE-ready surrogate profiles. ArcGIS software was used to create a geodatabase file of the gridded surrogate areas by spatially intersecting the 4km grid with the shape files. SAS software programs were then used used to calculate the surrogate ratios, gap fill areas without surrogates, format the final files and to perform quality assurance checks. This document also summarizes the available surrogates and their codes. The approach for developing the Canadian surrogates is also included in this document. 4) DOCUMENTATION: SURROGATE DATA, DEFINITIONS AND CODES Surrogate_Documentation_Workbook052804.xls DESCRIPTION: This Excel workbook provides information about the surrogates as described below:. The first file in the workbook (Sources of US 2000 Surrogates) documents the US ARCGIS shapefiles. These are posted on EPA's CHIEF website - go to http://www.epa.gov/ttn/chief/emch/spatial/newsurrogate.html and scroll down to "data files" to get to the ftp site. This spreadsheet provides information on the surrogate data source, vintage, geographic extent, resolution, and other information. The associated ArcGIS Shapefile name is also identified. The second file in the workbook (Airport Points Documentation) describes the data sources and procedures used to develop the "Airport Point" surrogate. The third file in the workbook (Surrogate Defn.s) contains information on the surrogate ratios in the SMOKE gridded ratio files (item 1) which were derived from the shapefiles. This spreadsheet contains the ratio definition and computational method for those that are combined surrogates such as "Housing Change and Population (140)". The fourth file in the workbook (Surrogate Codes) lists the surrogates and the surrogate cross reference codes. Associated secondary and tertiary surrogates are also identified. The fifth file in the workbook (roadway surrogates) describes the particular TIGER CFCC codes used to develop the roadway surrogates and shows how they are mapped to Federal Highway Road Classes. 5) DOCUMENTATION: SCC-TO-SURROGATE ASSIGNMENTS Surrogate_assignments_us_51904.xls This spreadsheet lists the SCC, SCC Description, associated surrogate, and the surrogate code for the US emissions inventory as well as the description of the surrogate used previously for criteria modeling (prior to the development of these new surrogates). Note that all of the SCCs in the 1999 NEI (version 2 criteria and version 3 toxics) are contained in this spreadsheet, a few SCCs from version 3 criteria and the 2002 draft may also be included (were added in May 2004). 6) DOCUMENTATION: GAP-FILLING INFORMATION Gap_Filling052504.xls DESCRIPTION: This EXCEL file identifies counties in the US that required secondary and tertiary surrogates. These were applied to fill in surrogate ratios for counties where the primary surrogate is zero for that particular county. Gap filling was performed for every surrogate that had at least some counties which did not have a value for that surrogate, regardless of whether the surrogate is needed for that county for the particular inventory being processed. Gap-filling was not done for the Canadian surrogates. 7) DOCUMENTATION: REVIEW OF SURROGATE DATA SOURCES Review_of_existing_data_sources.pdf DESCRIPTION An extensive review of existing sources of data that could potentially be used as emissions surrogates was performed prior to the development of the surrogates. This technical memorandum (in Adobe Acrobat format) identifies the origin, vintage, quality and completeness of these sources. Note that not all of the surrogates in the SMOKE ratio files (item 1) are contained as some were found after this document was completed. The first spreadsheet (Sources of 2000 US Surrogates) in the Surrogate_Documentation_Workbook.xls (item 3 above) refers to page numbers in this document. 8) DOCUMENTATION: CANADIAN SURROGATES SUPPLIED BY ENVIRONMENT CANADA Canada_Emissions_Distribution_Techniques.doc Canadian_Surrogates.xls Canada_SIC_SCC.xls DESCRIPTION A different set of surrogates was used for Canada to be compatible with their emissions inventory. The base data for most of the surrogates are propriatary and were not available to the US EPA for gridded surrogate development. Shape files of Canadian 10km gridded surrogates were provided by Environment Canada and then re-gridded to the 4km grid described above. The document "Canada Emissions Distribution Techniques" describes the techniques used by Environment Canada in the development of their 10km gridded surrogates. The Excel workbook, Canadian Surrogates.xls, cross-references the emissions source category (SIC for Canada)to the surrogate profile number. This was provided by Environment Canada and was used to develop the surrogate profile with the exceptions of the onroad vehicles (documented in item 6). The Excel workbook, Canada_SIC_SCC.xls, lists the SIC (used by Environment Canada) with the corresponding SCC. This crosswalk was extracted from the emissions inventory supplied by Environment Canada. Canadian surrogate profiles represent the ratio of the grid cell surrogate to the total province surrogate value. Ratios will sum to less than 1 for those provinces not entirely within the grid domain. The surrogate profiles identified above can only be used with a province level emission inventory. Note: SAS is a registered trademark of SAS Institute, Inc. ArcInfo and ArcGIS are registered trademarks of Environmental Systems Research Institute, Inc.