Causes of Haze Assessment (COHA)

Analysis by state

Analysis by class I

Analysis by region

Descriptive maps

PMF Modeling

For best results,
please use:

Internet Explorer 5
(or higher)

Netscape 6
(or higher)

more info...

PMF Modeling

Introduction

In order to identify the sources of aerosols in the Fish and Wildlife Class I Areas in the central and eastern United States, Positive Matrix Factorization (PMF) receptor model is applied to the 24-hr integrated aerosol chemical composition data obtained through the Interagency Monitoring of Protected Visual Environments (IMPROVE) program. PMF is applied to each site to generate profiles of source factors. Normalized source profiles and the quantitative source contributions for each resolved factor are calculated.

Model Description

PMF is a statistical method that identifies a user specified number of source profiles (i.e. relative composition particle species for each source) and source strengths for each sample period that reduce the difference between measured and PMF fitted mass concentration. Equations 1 – 4 show the major steps of the model calculation.

(1)

where: X (n * Sp) = a matrix of observed fine particulate species concentrations with the dimensions of number of observations by the number of species

G (n * f) = a matrix of source contributions by observation day with the dimensions of number of observations by the number of factors

F (f * Sp) = a matrix of source profiles with the dimensions of number of factors by the number of species

E (n * Sp) = a matrix of random errors with the dimensions of number of observations by number of species

The F and G matrices of the final solution are then normalized according to the following equations to determine the quantitative source contributions (Ci, µg/m3)) and profiles for each source (Si, µg/µg).

(2)

where: Si = the row of the source profile matrix for source i

Fij = the source profile value for specie j of source i

FMi = the calculated average total fine mass contribution for source i

(3)

where: Ci = the column of the source contribution matrix for source i

Gki = the source contribution on day k for source i

FMi = the calculated average total fine mass contribution for source i

The FM_i is determined by regression total PM_2.5mass concentrations in the kth sample (m_k) against estimated source contribution values.

(4)

Input Data

1. Download aerosol PM10 and PM2.5 mass and chemical speciation data including analytical uncertainty and minimum detection limit from the VIEWS web site.

2. Data are screened to remove the days when PM10 or PM2.5 mass concentration or concentration of any major aerosol component is missing.

Table 1 lists all the concentrations measured by IMPROVE and the fraction of the measurement values that are below the minimum detection limit based on all the data available from all the sites. The components with measurement values below the detection limit more than 50% of the time, except major components including EC3, OC1, Chloride and Sodium as marked gray, are not included in the PMF modeling and marked red in the table. As sulfate and sulfur are highly correlated (slope = 3, R2 = 0.96), only sulfur is included in the PMF modeling. Therefore, a total of 29 species are included in the PMF modeling.

3. Data value and associated uncertainty are calculated as:

If data is missing Then

data value = geometric mean of the measured values
uncertainty = 4 * geometric mean of the measured values

Else if data bellows detection limit

data value = 1/2 * detection limit
uncertainty = 5/6 * detection limit

Else

data value = measured data
uncertainty = analytical uncertainty + 1/3 * detection limit

Table 1. Fraction of the measurement values that are below the minimum detection limit. Red:>50% below detection limit, not included in PMF, Gray: >50% below detection limit, still included in PMF and Yellow: not included in PMF

Major Control Parameters

1. Running Mode: Robust Mode, the value of outlier threshold distance a = 4.0 (i.e. if the residue exceeds 4 times of the standard deviation, a measured value is considered outlier). The least squares formulation thus becomes:

2. Error Mode (decides the standard deviation of the data Sij):
EM = -12 (based on observed value) Sij = Tij

3. FPEAK and FKEY Matrix (controls the rotation) – default: 0 (central), may try different numbers

Determination of the number of sources (factors)

Then the following methods are used to decide how many factors (sources) should be chosen for each area.

1. The regression coefficients FMi should be positive. If the regression produces a negative coefficient, it suggests that maybe too many source factors have been used.

2. The sum of the scaled profiles should be less than unity. If any of them is much higher than unity (i.e. >2) it suggests that too few sources may have been chosen.

3. Other PMF output parameters:

a. Q value

d. Largest element in Rotmat matrix

4. Judgement based on literature, known CMB source profiles, and experience

Output Data

1. Source (factor) profiles for each site.

Source (factor) profiles (see example as shown in Figure 2) which represent the fraction of each aerosol component to total PM2.5 mass for PMF defined sources (factors) are calculated for each site.

Figure 2. Example source (factor) profile (unit: mg/mg) based on PMF modeling

2. Contribution of each source (factor) to aerosol mass and light extinction for each sampling day.

For each IMPROVE site in the Fish and Wildlife Class I Areas in the eastern and central US, contributions of each source (factor) to aerosol mass and light extinction for each sampling day are calculated (see example as shown in Figure 3).

Figure 3. Contributions of major aerosol sources to PM2.5 mass during the sampling days of 2004 (example).

Naming Factors

Generally, source factors are identified based on the PMF generated source profiles (signatures). The biomass smoke factor is dominated by OC/EC with significant amount of K, while secondary sulfate and nitrate factors are dominated by sulfate and nitrate, respectively. Major dust components such as silicon, calcium, iron, and potassium are present in the dust factor. Vanadium and Nickel, which are almost exclusively from oil combustion, are used as the signatures of the oil combustion source factor (e.g. shipping emissions). A mobile emission factor is identified with large fraction of OC/EC, and metals such as Zn, Pb and Cd. Generally, EC dominates in diesel emissions and OC dominates in gasoline emissions. Se, As and Sb are tracers for coal combustion sources. Of course, large amount of OC/EC and sulfate always exit in coal combustion emissions. Sodium and chlorine (or chloride) are the major aerosol species in sea salt aerosols. Emissions from Kraft recovery boilers of paper mills have been shown to consist largely of sodium and sulfate particles (i.e. Na₂SO₄) with lesser but significant amounts of Chlorine and Potassium.

Page last updated 20 September 2007