APPENDIX
Gap (Filling) Strategies
> Missing Data and Data Rejection Criteria: Multiple Causes
System/Sensor Breakdown
Raw data (off-scale, out of ensemble range)
Spikes
Anemometer/Rotation Angle
Missing Correction Terms (Pressure, Temperature, Profile measurements for CO2 or heat storage)
Stationarity Tests
Integral Turbulence Characteristics
Foot-print/Source Weight
Biological Constraints (Growing season - Leafless period, Bud Break - Leaf Senescence)
Physical Constraints (Available Energy, Energy Balance Closure)
About 36 % gaps and rejected data are found in the seasonal and daily courses of NEE at Walker Branch 1997 (Fig.1). The rejection or gap probability decreases using the CO2 flux measured above the canopy and not corrected for storage in the canopy air layer or drift. Those terms are determined independently and can add additional gap percentages.

> Documentation of Applied Rejection Criteria
Applied rejection criteria for flux (CO2 Flux, LE, H, Momentum Flux), storage (CO2, LE, H in air layer, H in biomass), and main meteorology variables should be documented. Documentation would be useful in interpreting gap periods, decision which gap filling strategy is most applicable, and investigating and possibly reducing rejection causes. A binary flag system of rejection criteria for each variable would allow tracking multiple causes of data rejection. Each bin of the binary code represents one particular reason for data rejection. Instead of the binary value the corresponding decimal number could be stored as in the flag column of each variable:
|
1 |
0 |
0 |
1 |
0 |
1 |
> Linear Interpolation
Linear interpolation is often being used for small gaps (2-3 half-hourly means missing), and especially useful to complete meteorological variables as temperature or relative humidity. Applied to NEE data the results are conflicting: depending on the size of the gaps more or less reasonable results are obtained (Fig.2).

Figure 2: Linear interpolation used to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997.
> Mean Day Course based on half-hourly or hourly data
This gap filling method uses mean daily courses to fill gaps in half-hourly or hourly data. Differences in methods are based on the length of the time interval, which is used to calculate the mean daily course (usually 4 to 15 days). Further, a so-called running mean can be applied (i.e. the 15 days previous to the gap are used to calculate the daily course). On the other hand, the entire time period can be cut into short periods, where gaps within one period are filled with the representative half-hourly value from the mean daily course for that period.
Whereas monthly intervals - especially during changing weather, bud-break or leaf-fall periods - are supposed to be too large (see standard deviations in Fig. 3), 15 day or shorter periods are often applied and on an average seem to show smaller standard deviations (Fig.4). But interpretation of this figure remains somewhat difficult, as the standard deviations in shorter time periods are strongly dependent on weather changes during the investigated period.

Figure 3: July mean day course of NEE at Walker Branch 1997.

Figure 4: Frequency distribution of the standard deviations for mean half-hourly values of NEE; the period for taking the mean were 28, 24, 20, 16, 12, 8, and 4 days during 1997 at Walker Branch.

Figure 5: Running bi-weekly mean daily course to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a hourly data basis.

Figure 6: Running bi-weekly mean daily course to fill gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a half-hourly data basis.
> Semi-empirical Gap Filling
Gap filling for NEE:
Response to Temperature and Photosynthetic Photon Flux Density as described by average NEE for assorted environmental conditions
Carbon fluxes from each site observed during daytime, low atmospheric stability, and periods of similar leaf area were sorted into 2 degree C air temperature classes at the time of observation and plotted against incident photosynthetic photon flux density (PPFD) measured above the canopy. As shown in the examples of Figure 7, the light response curves apparently show high scatter, indicated by the standard deviations. High scatter in the data is expected and depends on additional factors not yet included in the regression analysis, as effects of daytime, seasonality of leaf physiology, water availability, or inhomogeneity in fetch/footprint area of the tower site. Nevertheless, the data provide good estimates of the estimated respiration rate as light decreases to zero (Rday), of the quantum yield a' as the primary slope, and of the maximum carbon uptake (Fcsat) at light saturation. The light dependence for each 2 degree temperature class is described via a modified Michaelis-Menten type function with three parameters: Rday, the respiration term combining leaf, bole and soil respiration, a', the quantum yield of the canopy, and Fcsat, the saturation value in the absence of respiration.
(Eq. 1)
Parameter estimates were obtained from nonlinear regression analyses applied to the data of each temperature class. Responses studied in this manner provide a large data set describing the parameters Rday, Fcsat, and a' as a function of temperature.

Figure 7: Mean light response of NEE sorted in air temperature classes at Walker Branch 1997 on a half-hourly data basis. Bars indicate the standard deviation of the mean NEE within one light and temperature class
The temperature dependency of the respiration term, Rday, may be described by an exponential function of air temperature, although Rday show slightly decreased values at higher temperatures.
(Eq. 2)
where fair and Eair are scaling constant and activation energy, respectively. TK is air temperature in Kelvin, and R is the gas constant (8.31 J K-1 mol-1). A concern in the analysis of the respiration data is that a clear separation of the different respiration sources (leaves, bole, and soil) is not possible. The contribution of these sources will change over time, and in response to different developmental factors (e.g. air, soil, and bole temperature, soil water potential, etc.). Nevertheless, we decided to include only the response to air temperature.
The temperature dependence of the potential rate of carbon uptake at saturating light, Fcsat, is described by:
(Eq. 3)
where Tair is air temperature (K), R is the gas constant (8.31 J K-1 mol-1), D Ha(Fcsat) is the activation energy, D Hd(Fcsat) the energy of deactivation, D S(Fcsat) is an entropy term and c(Fcsat) is a scaling constant.
Values of a', the quantum yield of the canopy, may be directly estimated from the slope of the light-response-curve of canopy carbon dioxide exchange. In this analysis a' were determined together with Rday and Fcsat for each temperature class using Eq. 1 and non-linear regression methods, and showed an optimum curve with temperature. Whereas at leaf level the light use efficiency (a ) at saturating CO2 shows no temperature response, Ehleringer and Björkman (1977) showed leaf quantum yield as temperature dependent. The quantum yield of a canopy, a', is an operationally determined parameter based on stand gas exchange data. Therefore light distribution effects are included in the estimate of the quantum yield, and a’ may be expected to decrease for inhomogeneous, clumped canopy architecture.
Describing the optimum temperature response of a', an equation analogous to the response of Fcsat is used:
(Eq. 4)
where Tair is air temperature (K), R is the gas constant (8.31 J K-1 mol-1), D Ha(a') is the activation energy, D Hd(a') the energy of deactivation, D S(a') is an entropy term and c(a') is a scaling constant.
Semi-empirical gap filling provides preservation of flux responses to meteorological drivers (as temperature, incident light, or vapor pressure deficit), but also shows the need of complete meteorological data sets (compare white bars in Fig. 8).

Figure 8: Semi-empirical fill of gaps in seasonal and daily courses of NEE at Walker Branch 1997 on a half-hourly data basis. Gaps in meteorological drivers are shown in white. The annual sum of NEE is calculated based on semi-empirical filled data, but using running bi-weekly day courses for periods, where air temperature or PAR was missing.
One aspect reducing the still large standard deviations of NEE within one light and temperature class (see Fig. 7) is consideration of the relative amounts of diffuse and direct radiation. During overcast conditions, all available radiation is diffuse, whereas under a clear sky a high proportion of the radiation is direct. For a given radiation amount above the vegetation the diffuse portion penetrates much deeper in the canopy, and increases the contribution of lower canopy parts to the systems carbon uptake. The direct portion of the light, as a parallel beam, reaches only the sunlit area in the canopy, which decreases exponentially with canopy depth. This part of the light cannot be used as efficiently as diffuse radiation, which reaches also shaded leaves. In terms of light responses of canopy NEE this implies consideration of different shapes for the response curve according to the sky conditions. Not all sites do measure diffuse and total available radiation. So other methods to distinguish clear, cloudy and overcast time periods have to be considered. The method here calculates for each half-hour the potential radiation at the top of the atmosphere depending on latitude, longitude, Julian day, and daytime. The ratio between the measured total PPFD and this potential value depends on the solar elevation angle (see Fig. 9), and is used as an indicator of the sky conditions. The threshold lines are site-specific, somehow arbitrary and more an engineering approach. The lower line is set to 50% of the upper line. More sophisticated suggestions are welcome.
The effect on the light responses of NEE under clear, cloudy and overcast sky conditions for Tharandt, a Norway spruce site in Germany (EUROFLUX), and Walker Branch, the oak-hickory-maple stand in Tennessee (AmeriFlux) is shown in Fig. 10. The data in this figure is also pre-sorted in temperature classes. For the AmeriFlux sites Howland, Shidler, and Little Washita Fig. 11 summarizes the shift of maximum NEE towards lower light intensities at a given temperature range when sky conditions are cloudy or overcast. Using temperature, and radiation driven semi-empirical gap-filling therefore should also consider the "quality" of light as indicated by sky conditions. The implications of a relative higher net photosynthesis during periods with large diffuse proportion on estimates of future carbon uptake are worth considering, as global climate change models predict increasing cloudiness.

Figure 9: Ratio between the measured photosynthetic active radiation above the canopy and the potential radiation (wavelengths 400-700 nm) at the top of the atmosphere, and its dependence on the solar elevation angle. Also indicated the threshold lines for distinguishing clear, cloudy and overcast sky conditions.

Figure 10: Light responses of NEE under clear, cloudy and overcast sky conditions for Tharandt, a Norway spruce site in Germany (EUROFLUX), and Walker Branch, an oak-hickory-maple stand in Tennessee (AMERIFLUX). As indicated data represent also a pre-sorted temperature class.



Figure 11: Temperature and light responses of NEE under clear, cloudy and overcast sky conditions for Howland, Shidler, and Little Washita data (AmeriFlux). Under cloudy or overcast conditions all sites show a shift of maximum NEE towards lower light intensities.
Application of the differing filling methods for annual estimates of NEE and Evapotranspiration can result in big differences for the annual sum. Not considering linear filling methods, depending on the percentage of rejected data annual carbon gain in g C m-2 yr-1 can differ by 30-50, but up to 70 grams, especially for stands where ca. 50% of the data are rejected or missing (see Table 1). Evapotranspiration is more conservative, but 15-20 mm yr-1 differences are typical values (Table 2).
Annual Estimates of NEE
|
g C m-2 yr-1 |
WBW95 |
WBW96 |
WBW97 |
LO97 |
TH97 |
HL96 |
HV92 |
HV93 |
HV94 |
HV95 |
HV96 |
|
Linear Interpolation |
630 |
852 |
966 |
450 |
757 |
695 |
502 |
113 |
197 |
459 |
319 |
Original time step
|
Running Biweekly Mean |
552 |
724 |
904 |
324 |
605 |
321 |
299 |
408 |
253 |
363 |
300 |
|
Biweekly Mean |
530 |
719 |
905 |
262 |
608 |
318 |
269 |
270 |
250 |
356 |
287 |
|
% rejected data |
44.3 |
36.4 |
28.9 |
54.6 |
35.7 |
45.2 |
39.7 |
45.6 |
19.0 |
24.9 |
17.8 |
Time step first merged by factor 2
|
Running Biweekly Mean |
537 |
708 |
892 |
311 |
616 |
303 |
292 |
390 |
246 |
357 |
291 |
|
Biweekly Mean |
506 |
716 |
891 |
242 |
626 |
307 |
264 |
275 |
245 |
354 |
279 |
|
% rejected data |
33.6 |
26.4 |
19.4 |
52.8 |
29.5 |
36.8 |
35.3 |
41.8 |
15.0 |
20.2 |
12.7 |
|
Semi-Empirical Method |
510 |
693 |
855 |
301 |
620 |
249 |
363 |
396 |
222 |
347 |
288 |
|
Semi-Emp., but for diff. Sky conditions |
508 |
689 |
876 |
308 |
629 |
253 |
|||||
|
Semi-Empirical, but nighttime using soil temperature response |
605 |
786 |
946 |
242 |
635 |
- |
Table 1. Annual sums of NEE for different sites and years in g C m-2 yr-1. Gaps were filled by differing filling strategies.
Annual Estimates of Evapotranspiration
|
mm yr-1 |
WBW95 |
WBW96 |
WBW97 |
LO97 |
TH97 |
HL96 |
HV92 |
HV93 |
HV94 |
HV95 |
HV96 |
|
Linear Interpolation |
624 |
567 |
634 |
488 |
530 |
439 |
502 |
398 |
620 |
593 |
402 |
Original time step
|
Running Biweekly Mean |
596 |
562 |
609 |
431 |
492 |
383 |
386 |
627 |
591 |
529 |
491 |
|
Biweekly Mean |
606 |
563 |
613 |
399 |
480 |
397 |
390 |
606 |
582 |
527 |
362 |
|
% rejected data |
50.5 |
17.2 |
22.5 |
40.0 |
34.3 |
40.8 |
34.4 |
56.4 |
24.7 |
25.7 |
43.4 |
Time step first merged by factor 2
|
Running Biweekly Mean |
601 |
560 |
606 |
428 |
487 |
375 |
380 |
604 |
590 |
525 |
485 |
|
Biweekly Mean |
603 |
562 |
609 |
400 |
477 |
391 |
384 |
587 |
580 |
524 |
352 |
|
% rejected data |
39.3 |
10.0 |
14.1 |
38.2 |
28.1 |
32.6 |
29.5 |
53.6 |
20.8 |
20.9 |
39.7 |
|
Semi-Empirical Method |
588 |
554 |
566 |
414 |
488 |
360 |
388 |
648 |
573 |
539 |
480 |
|
Semi-Emp., but for diff. Sky conditions |
587 |
559 |
569 |
414 |
488 |
362 |
|||||
|
Semi-Empirical, but nighttime using soil temperature response |
617 |
562 |
576 |
417 |
505 |
- |
Table 2. Annual sums of evapotranspiration for different sites and years in mm yr-1. Gaps were filled by differing filling strategies.