APPENDIX

 

Gap (Filling) Strategies

 

> Missing Data and Data Rejection Criteria: Multiple Causes

System/Sensor Breakdown

Raw data (off-scale, out of ensemble range)

Spikes

Anemometer/Rotation Angle

Missing Correction Terms (Pressure, Temperature, Profile measurements for CO2 or heat storage)

Stationarity Tests

Integral Turbulence Characteristics

Foot-print/Source Weight

Biological Constraints (Growing season - Leafless period, Bud Break - Leaf Senescence)

Physical Constraints (Available Energy, Energy Balance Closure)

 

About 36 % gaps and rejected data are found in the seasonal and daily courses of NEE at Walker Branch 1997 (Fig.1). The rejection or gap probability decreases using the CO2 flux measured above the canopy and not corrected for storage in the canopy air layer or drift. Those terms are determined independently and can add additional gap percentages.

Figure 1: Gaps and rejected data (white) in the seasonal and daily courses of NEE at Walker Branch 1997.

 

 

> Documentation of Applied Rejection Criteria

 

Applied rejection criteria for flux (CO2 Flux, LE, H, Momentum Flux), storage (CO2, LE, H in air layer, H in biomass), and main meteorology variables should be documented. Documentation would be useful in interpreting gap periods, decision which gap filling strategy is most applicable, and investigating and possibly reducing rejection causes. A binary flag system of rejection criteria for each variable would allow tracking multiple causes of data rejection. Each bin of the binary code represents one particular reason for data rejection. Instead of the binary value the corresponding decimal number could be stored as in the flag column of each variable:

1

0

0

1

0

1

 

 

> Linear Interpolation

 

Linear interpolation is often being used for small gaps (2-3 half-hourly means missing), and especially useful to complete meteorological variables as temperature or relative humidity. Applied to NEE data the results are conflicting: depending on the size of the gaps more or less reasonable results are obtained (Fig.2).

 

Figure 2: Linear interpolation used to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997.

 

 

> Mean Day Course based on half-hourly or hourly data

 

This gap filling method uses mean daily courses to fill gaps in half-hourly or hourly data. Differences in methods are based on the length of the time interval, which is used to calculate the mean daily course (usually 4 to 15 days). Further, a so-called running mean can be applied (i.e. the 15 days previous to the gap are used to calculate the daily course). On the other hand, the entire time period can be cut into short periods, where gaps within one period are filled with the representative half-hourly value from the mean daily course for that period.

Whereas monthly intervals - especially during changing weather, bud-break or leaf-fall periods - are supposed to be too large (see standard deviations in Fig. 3), 15 day or shorter periods are often applied and on an average seem to show smaller standard deviations (Fig.4). But interpretation of this figure remains somewhat difficult, as the standard deviations in shorter time periods are strongly dependent on weather changes during the investigated period.

 

Figure 3: July mean day course of NEE at Walker Branch 1997.

 

Figure 4: Frequency distribution of the standard deviations for mean half-hourly values of NEE; the period for taking the mean were 28, 24, 20, 16, 12, 8, and 4 days during 1997 at Walker Branch.

Figure 5: Running bi-weekly mean daily course to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a hourly data basis.

 

Figure 6: Running bi-weekly mean daily course to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a half-hourly data basis.

 

 

> Semi-empirical Gap Filling

 

Gap filling for NEE:

Response to Temperature and Photosynthetic Photon Flux Density as described by average NEE for assorted environmental conditions

Carbon fluxes from each site observed during daytime, low atmospheric stability, and periods of similar leaf area were sorted into 2 degree C air temperature classes at the time of observation and plotted against incident photosynthetic photon flux density (PPFD) measured above the canopy. As shown in the examples of Figure 7, the light response curves apparently show high scatter, indicated by the standard deviations. High scatter in the data is expected and depends on additional factors not yet included in the regression analysis, as effects of daytime, seasonality of leaf physiology, water availability, or inhomogeneity in fetch/footprint area of the tower site. Nevertheless, the data provide good estimates of the estimated respiration rate as light decreases to zero (Rday), of the quantum yield a' as the primary slope, and of the maximum carbon uptake (Fcsat) at light saturation. The light dependence for each 2 degree temperature class is described via a modified Michaelis-Menten type function with three parameters: Rday, the respiration term combining leaf, bole and soil respiration, a', the quantum yield of the canopy, and Fcsat, the saturation value in the absence of respiration.

 

(Eq. 1)

Parameter estimates were obtained from nonlinear regression analyses applied to the data of each temperature class. Responses studied in this manner provide a large data set describing the parameters Rday, Fcsat, and a' as a function of temperature.

Figure 7: Mean light response of NEE sorted in air temperature classes at Walker Branch 1997 on a half-hourly data basis. Bars indicate the standard deviation of the mean NEE within one light and temperature class

The temperature dependency of the respiration term, Rday, may be described by an exponential function of air temperature, although Rday show slightly decreased values at higher temperatures.

(Eq. 2)

where fair and Eair are scaling constant and activation energy, respectively. TK is air temperature in Kelvin, and R is the gas constant (8.31 J K-1 mol-1). A concern in the analysis of the respiration data is that a clear separation of the different respiration sources (leaves, bole, and soil) is not possible. The contribution of these sources will change over time, and in response to different developmental factors (e.g. air, soil, and bole temperature, soil water potential, etc.). Nevertheless, we decided to include only the response to air temperature.

The temperature dependence of the potential rate of carbon uptake at saturating light, Fcsat, is described by:

(Eq. 3)

where Tair is air temperature (K), R is the gas constant (8.31 J K-1 mol-1), D Ha(Fcsat) is the activation energy, D Hd(Fcsat) the energy of deactivation, D S(Fcsat) is an entropy term and c(Fcsat) is a scaling constant.

Values of a', the quantum yield of the canopy, may be directly estimated from the slope of the light-response-curve of canopy carbon dioxide exchange. In this analysis a' were determined together with Rday and Fcsat for each temperature class using Eq. 1 and non-linear regression methods, and showed an optimum curve with temperature. Whereas at leaf level the light use efficiency (a ) at saturating CO2 shows no temperature response, Ehleringer and Björkman (1977) showed leaf quantum yield as temperature dependent. The quantum yield of a canopy, a', is an operationally determined parameter based on stand gas exchange data. Therefore light distribution effects are included in the estimate of the quantum yield, and a’ may be expected to decrease for inhomogeneous, clumped canopy architecture.

Describing the optimum temperature response of a', an equation analogous to the response of Fcsat is used:

(Eq. 4)

where Tair is air temperature (K), R is the gas constant (8.31 J K-1 mol-1), D Ha(a') is the activation energy, D Hd(a') the energy of deactivation, D S(a') is an entropy term and c(a') is a scaling constant.

Semi-empirical gap filling provides preservation of flux responses to meteorological drivers (as temperature, incident light, or vapor pressure deficit), but also shows the need of complete meteorological data sets (compare white bars in Fig. 8).

 

 

 

Figure 8: Semi-empirical fill of gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a half-hourly data basis. Gaps in meteorological drivers are shown in white. The annual sum of NEE is calculated based on semi-empirical filled data, but using running bi-weekly day courses for periods, where air temperature or PAR was missing.

 

One aspect reducing the still large standard deviations of NEE within one light and temperature class (see Fig. 7) is consideration of the relative amounts of diffuse and direct radiation. During overcast conditions, all available radiation is diffuse, whereas under a clear sky a high proportion of the radiation is direct. For a given radiation amount above the vegetation the diffuse portion penetrates much deeper in the canopy, and increases the contribution of lower canopy parts to the systems carbon uptake. The direct portion of the light, as a parallel beam, reaches only the sunlit area in the canopy, which decreases exponentially with canopy depth. This part of the light cannot be used as efficiently as diffuse radiation, which reaches also shaded leaves. In terms of light responses of canopy NEE this implies consideration of different shapes for the response curve according to the sky conditions. Not all sites do measure diffuse and total available radiation. So other methods to distinguish clear, cloudy and overcast time periods have to be considered. The method here calculates for each half-hour the potential radiation at the top of the atmosphere depending on latitude, longitude, Julian day, and daytime. The ratio between the measured total PPFD and this potential value depends on the solar elevation angle (see Fig. 9), and is used as an indicator of the sky conditions. The threshold lines are site-specific, somehow arbitrary and more an engineering approach. The lower line is set to 50% of the upper line. More sophisticated suggestions are welcome.

The effect on the light responses of NEE under clear, cloudy and overcast sky conditions for Tharandt, a Norway spruce site in Germany (EUROFLUX), and Walker Branch, the oak-hickory-maple stand in Tennessee (AmeriFlux) is shown in Fig. 10. The data in this figure is also pre-sorted in temperature classes. For the AmeriFlux sites Howland, Shidler, and Little Washita Fig. 11 summarizes the shift of maximum NEE towards lower light intensities at a given temperature range when sky conditions are cloudy or overcast. Using temperature, and radiation driven semi-empirical gap-filling therefore should also consider the "quality" of light as indicated by sky conditions. The implications of a relative higher net photosynthesis during periods with large diffuse proportion on estimates of future carbon uptake are worth considering, as global climate change models predict increasing cloudiness.

 

Figure 9: Ratio between the measured photosynthetic active radiation above the canopy and the potential radiation (wavelengths 400-700 nm) at the top of the atmosphere, and its dependence on the solar elevation angle. Also indicated the threshold lines for distinguishing clear, cloudy and overcast sky conditions.

 

Figure 10: Light responses of NEE under clear, cloudy and overcast sky conditions for Tharandt, a Norway spruce site in Germany (EUROFLUX), and Walker Branch, an oak-hickory-maple stand in Tennessee (AMERIFLUX). As indicated data represent also a pre-sorted temperature class.

 

 

Figure 11: Temperature and light responses of NEE under clear, cloudy and overcast sky conditions for Howland, Shidler, and Little Washita data (AmeriFlux). Under cloudy or overcast conditions all sites show a shift of maximum NEE towards lower light intensities.

 

Application of the differing filling methods for annual estimates of NEE and Evapotranspiration can result in big differences for the annual sum. Not considering linear filling methods, depending on the percentage of rejected data annual carbon gain in g C m-2 yr-1 can differ by 30-50, but up to 70 grams, especially for stands where ca. 50% of the data are rejected or missing (see Table 1). Evapotranspiration is more conservative, but 15-20 mm yr-1 differences are typical values (Table 2).

 

Annual Estimates of NEE

g C m-2 yr-1

WBW95

WBW96

WBW97

LO97

TH97

HL96

HV92

HV93

HV94

HV95

HV96

Linear Interpolation

630

852

966

450

757

695

502

113

197

459

319

Original time step

Running Biweekly Mean

552

724

904

324

605

321

299

408

253

363

300

Biweekly Mean

530

719

905

262

608

318

269

270

250

356

287

% rejected data

44.3

36.4

28.9

54.6

35.7

45.2

39.7

45.6

19.0

24.9

17.8

Time step first merged by factor 2

Running Biweekly Mean

537

708

892

311

616

303

292

390

246

357

291

Biweekly Mean

506

716

891

242

626

307

264

275

245

354

279

% rejected data

33.6

26.4

19.4

52.8

29.5

36.8

35.3

41.8

15.0

20.2

12.7

Semi-Empirical Method

510

693

855

301

620

249

363

396

222

347

288

Semi-Emp., but for diff. Sky conditions

508

689

876

308

629

253

         

Semi-Empirical, but nighttime using soil temperature response

605

786

946

242

635

-

         

 

Table 1. Annual sums of NEE for different sites and years in g C m-2 yr-1. Gaps were filled by differing filling strategies.

 

Annual Estimates of Evapotranspiration

mm yr-1

WBW95

WBW96

WBW97

LO97

TH97

HL96

HV92

HV93

HV94

HV95

HV96

Linear Interpolation

624

567

634

488

530

439

502

398

620

593

402

Original time step

Running Biweekly Mean

596

562

609

431

492

383

386

627

591

529

491

Biweekly Mean

606

563

613

399

480

397

390

606

582

527

362

% rejected data

50.5

17.2

22.5

40.0

34.3

40.8

34.4

56.4

24.7

25.7

43.4

Time step first merged by factor 2

Running Biweekly Mean

601

560

606

428

487

375

380

604

590

525

485

Biweekly Mean

603

562

609

400

477

391

384

587

580

524

352

% rejected data

39.3

10.0

14.1

38.2

28.1

32.6

29.5

53.6

20.8

20.9

39.7

Semi-Empirical Method

588

554

566

414

488

360

388

648

573

539

480

Semi-Emp., but for diff. Sky conditions

587

559

569

414

488

362

         

Semi-Empirical, but nighttime using soil temperature response

617

562

576

417

505

-

         

Table 2. Annual sums of evapotranspiration for different sites and years in mm yr-1. Gaps were filled by differing filling strategies.