TMCnet News

Determining Functional Relations in Multivariate Oceanographic Systems: Model II Multiple Linear Regression [Journal of Atmospheric and Oceanic Technology]

[July 24, 2014]

Determining Functional Relations in Multivariate Oceanographic Systems: Model II Multiple Linear Regression [Journal of Atmospheric and Oceanic Technology]

(Journal of Atmospheric and Oceanic Technology Via Acquire Media NewsEdge) ABSTRACT A method for estimating multivariate functional relationships between sets of measured oceanographic, meteorological, and other field data is presented. Model II regression is well known for describing functional relationships between two variables. However, there is little accessible guidance for the researcher wishing to apply model II methods to a multivariate system consisting of three or more variables. This paper describes a straightforward method to extend model II regression to the case of three or more variables.

The multiple model II procedure is applied to an analysis of the optical spectral scattering coefficient measured in the coastal ocean. The spectral scattering coefficient is regressed against both suspended mineral particle concentration and suspended organic particle concentration. The regression coefficients from this analysis provide adjusted estimates of the mineral particle scattering cross section and the organic particle scattering cross section. Greater accuracy and efficiency of the coefficients from this analysis, compared to semiempirical coefficients, is demonstrated. Examples of multivariate data are presented that have been analyzed by partitioning the variables into arbitrary bivariate models. However, in a true multivariate system with correlated predictors, such as a coupled biogeochemical cycle, these bivariate analyses yield incorrect coefficient estimates and may result in large unexplained variance. Employing instead a multivariate model II analysis can alleviate these problems and may be a better choice in these situations.

(ProQuest: ... denotes formulae omitted.) 1. Introduction Much oceanographic field research seeks to establish a functional relation, that is, find the slope and intercept of the line that best fits the bivariate scatter between two field-observed variables. A good example of this is found in Tett et al. (1975), where particulate carbon and particulate phosphorus are compared. Model II regression has often been suggested in this type of situa- tion to estimate the functional relationship between two variables (Ricker 1973; Sokal and Rohlf 1995; Warton et al. 2006). The literature is substantial regarding esti- mating a linear functional relationship, dating back to at least Pearson (1901), and several different methods have been proposed. A model II analysis was applied by Sverdrup (1916) in analyzing meteorological variables as early as 1916. Ricker (1973) proposed the use of model II regression in fishery studies. Laws and Archie (1981) asserted the advisability of using model II regression for various field studies, such as the investigation of oxygen consumption per body mass, that is, the metabolic rate, by oceanic zooplankton in the study of Ikeda (1970). Laws and Archie (1981) pointed out that studies of metabolic rate per body mass, phytoplankton ratios of carbon to chlorophyll, and morphometric analyses, all of which are determined from regression slopes, lend themselves to more efficient analyses by model II regression. Further studies of the variations of particulate carbon body mass regressed against chlorophyll concentration of phyto- plankton as discussed by Banse (1977) are also amenable to model II analysis.

In virtually all of the studies mentioned above, arbi- trary bivariate pairings of variables were modeled, even though the situations investigated actually involved three variables. For example, the studies of Ikeda (1970) involved oxygen uptake, body mass, and environmental temperature, and Banse (1977) considered the varia- tions in cellular particulate phosphorus content and other environmental variables in addition to chlorophyll concentration and particulate organic carbon mass. The multivariate nature of oceanographic investigations is becoming increasingly recognized, as in the studies of Fichot et al. (2008), while Gallie and Murtha (1992) applied a multiple regression analysis to the biogeo- optical properties of inland waters that will be instructive to oceanographers. Stavn and Richter (2008) applied model II multiple regression to determine the mass- specific scattering cross sections of suspended matter in the ocean, both inorganic and organic, to allow more accurate determinations of these materials by satellite remote sensing. All of these situations come under the subject of the interaction of physical properties with coupled biogeochemical cycles (Schlesinger et al. 2011).

In this paper a method for determining model II multiple regression estimates is presented. The method is applied to the determination of optimum mass-specific optical scattering cross sections for suspended matter in the coastal ocean. Section 2 contains a brief background on the development of model II regression methods. Section 3 introduces the notation for the bivariate case, applied to estimate the relation between fish mass and egg production. Section 4 extends the method to more than two variables and provides an example for the three- variable case: determining optical scattering cross sec- tions from oceanographic field measurements. Section 5 contains a discussion of theoretical issues of modeling relations and suggests other possible areas of application of model II multiple regression in oceanography.

2. Model II regression methods The most widely recommended methods for model II regression are the major axis (MA) method and the standardized (or reduced) major axis (SMA), also re- ferred to as the geometric mean method. These methods are well developed for the case of two variables (e.g., Laws 1997; Sokal and Rohlf 1995; Warton et al. 2006), but not for three or more variables. Thus, the primary goal of this paper is to present a method of estimating the functional relationship between any number of variables. Many authors have discussed aspects of re- gression for describing functional relationships, often in a broader context (see Anderson 1984; Kendall and Stuart 1977). However, we do not intend for this to be a general discussion of the various issues of estimation arising in those contexts. Instead, our goal is to outline a straightforward procedure for estimating the pa- rameters in a multivariate functional relationship.

Pearson (1901) showed that the best-fitting line to a system of points is in the direction of the major axis of the corresponding ellipse. Kermack and Haldane (1950) later showed that Pearson's method was not invariant to changes in scale of the variables and proposed the use of a "reduced" major axis, where the major axis method is applied to standardized variables. Ricker (1973) showed that geometric mean regression is identical to reduced major axis regression but far easier to compute. Jolicoeur (1975), however, argued against the reduced (or "stan- dard") major axis method, and Sprent and Dolby (1980) suggested using the line that bisects the minor angle between the two model I (ordinary least squares) re- gressions as the estimate of the functional relationship. Warton et al. (2006) argue that the term "standardized major axis" is best used to describe the major axis method applied to standardized data, and this is the term that will be used in this paper.

With the exception of Pearson (1901), who also con- sidered the three-variable case, all of the previously mentioned references address only the case of relating two variables. Pearson's 1901 paper is considered the foundation of principal component analysis (PCA), and PCA provides a framework for estimating functional relationships that can easily be extended to more than two variables. While the relationship between PCA and model II methods is generally appreciated, rarely are model II estimates presented using the results of a PCA analysis. Rencher (2002) described the relation between PCA and model II estimates for the two-variable case. In section 3, we present the notation for the two-variable case, and in section 4 we derive estimates for models with three or more variables.

3. Model II regression using PCA a. Two-variable case The two-variable case is considered here to introduce the notation required for extension to the multivariate case. One advantage of using PCA is that most statistical software packages (e.g., SAS, R) contain procedures to perform PCA, thus allowing model II estimates to be calculated without special programs.

Consider two variables, Y1 and Y2, for which it is de- sired to estimate the line that best fits the bivariate scatter between the two variables. This is accomplished by minimizing the sum of the squared perpendicular distances from each point to the regression line. To de- termine the equation of the line that best describes the functional relationship, let y1 and y2 be the sample means, and s12 and s22 the sample variances of Y1 and Y2, respectively; and let s12 be the sample covariance be- tween Y1 and Y2. Let a01 = (a11, a12 ) and a02 = (a21, a22) be the eigenvectors associated with the eigenvalues, l0 = (l1, l2), of the sample covariance matrix S,respec- tively. Then the principal components are the variables ...

and ...

Geometrically, the ordered pairs (y1, y2) might be characterized by an ellipse that contains the points. If this ellipse is rotated until the major (longer) axis is horizontal, then the new coordinates of the points are (z1, z2), with respect to the new horizontal axis, z1, and the new vertical axis (perpendicular to z1), z2. The major axis is the line passing through the point y0 = (y1, y2)in the direction determined by the first eigenvector, a01 = (a11, a12), which has a slope equal to a12/a11. The equation of the major axis can also be found by setting the second principal component, z2, to zero. Then, ...

and ... (1) Thus, the major axis is the line with slope 2a21/a22 and intercept y2 1 (a21/a22)y1. The slope given by a12/a11 = 2a21/a22 is the slope associated with the MA method.

The following example of biological oceanographic data provides an illustration of the ideas described above: Example 1-Sokal and Rohlf (1995) reported weights (Y1) of unspawned female cabezon (a California marine fish, Scorpaenichthys marmoratus) and the number of eggs subsequently produced (Y2) for 11 fish. The data are given in Table 1.

For the data from Table 1, y1 530.364, y2 576.545, s21593.355, s225418.873, and s125174.382. Then a015 (0.398, 0.917) and a02 = (0.917, 20.398) are the eigen- vectors associated with the eigenvalues, l0 = (494.6, 17.5), and the major axis has slope (20.917)/(20.398) = 2.304 andintercept76.5451(0.917330.364)/(20.398)56.586, and thus the equation is y2 = 6.586 1 2.304y1, illustrated in Fig. 1.

b. Using standardized data When the variables have substantially different vari- ances, the variable with the larger variance will dominate the eigenvector, in the sense that the weight assigned to that variable will be much larger. This can occur if Y1 and Y2 are measured in different units, or when the units are the same but the variation in observations is much larger for one of the variables. In example 1, notice that s2Y = 93.4, while s2Y = 418.9. The resulting slope estimate from the major axis regression is 2.30, which is much closer to the model I estimate obtained by regressing Y1 on Y2 (slope = 2.38) than the model 1 estimate obtained by regressing Y2 on Y1 (slope = 1.90). While the true slope of the relationship is unknown, this property of MA regression has often been criticized (e.g., Ricker 1973; Sokal and Rohlf 1995). In such a case it is often recommended that the variables be standardized before computing the model II estimates. This is equivalent to performing principal component analysis on the correlation, rather than covariance, matrix. In this case, as before, the equation of the major axis is ...

and ... (2) where a021 and a022 now represent the eigenvector com- ponents from the correlation matrix. With standardized data, variables contribute equally to the principal com- ponents, and the magnitudes of a021 and a022 will always be the same, and thus the slope estimator reduces to 6(s2/s1). This is the SMA estimator.

If standardized data are used for the Cabezon weight and egg production data in example 1, then the eigenvectors of the correlation matrix are a1 = (0.707, 0.707) and a2 = (0.707, 20.707). The slope is then ..., and the intercept is .... Substituting this value into Eq. (2) yields the equation y2 512.22712.118y1, illustrated in Fig. 1. The slope estimate, 2.118, can be shown to be the geometric mean of the two least squares slopes.

4. Extension to more than two variables a. PCA with three variables The results of the previous section can be extended to the case of three variables, Y1, Y2,andY3. Geometrically, the data now form a three-dimensional cloud, and if the variables are jointly correlated, then the cloud will be shaped like an ellipsoid (more or less shaped like an American football). Let y1, y2 , and y3 be the sample means and s12, s22, and s32 the sample variances of Y1, Y2 and Y3, respectively; and let s12, s13, and s23 be the sample covariances between Y1 and Y2, Y1 and Y3, and Y2 and Y3, respectively. Let a01 = (a11, a12, a13), a02 = (a21, a22, a23), and a03 = (a31, a32, a33) be the eigen- vectors of the sample covariance matrix S. Then the principal components are the variables ... and To derive the coefficients of the function relating Y3 to Y1 and Y2, set z3 = 0 and solve for y3.

...

and ... (3) The following example involving oceanographic field data is used to illustrate the method: Example 2-Oceanographic field data were collected on the optical scattering coefficient at a wavelength of 555 nm (b555), the particulate inorganic matter concen- tration (PIM), and the particulate organic matter con- centration (POM) of surface samples, from 24 field stations at Mobile Bay, Alabama (see Stavn and Richter 2008). The data are given in Table 2.

For the data from Table 2, y1 = 6.849, y2 = 1.738, y3 = 5.724; s12 = 20.777, s22 = 0.950, s32 = 14.696; and s12 52.533, s13516.471, s23 52.550. Then a015(0.764, 0.105, 0.637), a025(0.598, 20.485, 20.637), and a035 (20.242, 20.868, 0.434) are the eigenvectors associated withthe eigenvalues, l05(34.864, 1.175, 0.383).The co- efficient associated with PIM is 2(20.242/0.434) = 0.558, the coefficient associated with POM is 2(20.868/ 0.434) = 2.000, and the intercept is (20.242/4.334) (6.849)1(20.868/0.434)(1.738)521.571, which yields the MA model II equation b555 521.571 1 0.558 3 PIM1 2.0003 POM.

b. Using standardized data The principal components for the standardized data are ... and The equation of the plane determined by the first two principal components can be found by setting z3 = 0.

...

and ... (4) Thus, the slope coefficients are (2a031/a033 )(s3/s1) for Y1 and (2a32/a33)(s3/s2) for Y2. Unlike the two-variable case [Eq. (2)], the ratio of eigenvector elements will usually not equal 1, and thus the slope estimates do not reduce to the ratio of sample standard deviations.

Returning to example 2: The correlation matrix for Y1, Y2, and Y3 is ...

which has eigenvectors a01 = (0.594, 0.515, 0.618), a02 = (0.47420.845,20.249), and a035(0.650, 0.145,20.746) associated with the eigenvalues, l0 = (2.476, 0.479, 0.046). Then the slope coefficient associated with PIMis (0.650/20.746)(3.834/4.558)520.733, theslope coefficient associated with POM is (0.145/20.746)(3.834/ 0.975) 520.764, and the intercept is (0.650/20.746) (3.834/4.558)(6.849) 1 (0.145/20.746)(3.834 /0.975) (1.738) 1 5.724 520.625, which, when substituted into Eq. (4), yields the SMA model II equation b555 = 20.625 10.7333 PIM 1 0.7643 POM.

These values of b555 are compared with values de- termined in the same way from the Southwest Pass, Mississippi River, a theoretical calculation for suspended coccolithophore plates, and values of a claimed empirical proxy for the slope coefficient in Table 3.TheRcode (RCoreTeam2012) used to perform the calculations in sections 4a and 4b is included in the appendix.

c. Interpretation of slope coefficients As in model I multiple regression, each of the co- efficients of y1 and y2 can be interpreted as a conditional slope, holding the other variable constant. Geometrically, Eq. (4) represents a plane hovering above the plane de- termined by y1 and y2, as illustrated in Fig. 2. Condi- tioning on a value of y2, for example, is equivalent to taking a slice (which is a line) out of the plane and placing it on the y1y3 plane, where the value (2a31/a33) gives the slope of that line; that is, it gives a description of the linear relationship between y1 and y3 for a fixed value of y2.As is the case in model I regression, the interpretation of the conditional slopes becomes more complicated when the variables are correlated, as it may not be possible to change one variable value while holding the other constant.

d. Extension to more than three variables The results of sections 4a and 4b can be extended to more than three variables. In general, suppose there are p $ 2 observed variables, Y1, Y2, ..., Yp. The MA equation, using the covariance matrix, becomes ... (5) and thus the conditional slope of yk is (2api/app). When using the correlation matrix, the SMA equation becomes ... (6) e. Adjusted versus unadjusted coefficients When considering the analysis of environmental data, collected over various nonconstant conditions of envi- ronment, it becomes necessary to factor in other rel- evant variables in addition to a single variable pair of interest. Therefore, most models for environmental data are actually multivariate in nature. Coefficients adjusted for other variables are easy to obtain using model I regression, but in situations where model II regression is more appropriate, researchers have usually resorted to fitting separate one-predictor models to obtain co- efficient estimates (Banse 1977; Ikeda 1970; Steele and Baird 1961). In such a situation, if the explanatory var- iables are related, then regression applied in turn to variable pairs will result in incorrect estimates of the regression coefficients. Multiple predictor models provide more appropriate estimates than separate one- predictor models, especially when there is substantial dependence between the predictors.

Consider the coefficients obtained from Eq. (4) in section 4b, using standardized data. If separate one- predictor model II regression models, using Eq. (2), are fit to the data of Table 2, then the unadjusted slope co- efficients for PIM and POM are 0.841 and 3.934, re- spectively. Recall that the estimates obtained from the SMA method were 0.733 and 0.764, respectively, which are substantially different. This is not surprising, since the variables PIM and POM are moderately correlated (r = 0.57). Note also that the same phenomenon occurs using model I regression, where the multiple model I regression model estimates the coefficients to be 0.690 and 0.846 for PIM and POM, respectively, compared to 0.793 and 2.686, respectively, for the separate model I univariate regression models.

f. Confidence intervals for slope coefficients Once slope estimates are obtained, it might be desir- able to provide confidence limits on the true parameter values. Anderson (1963) discussed approximate confi- dence intervals for eigenvalues under asymptotic mul- tivariate normality. Jolicoeur (1968) proposed a small sample interval for the two-variable case under bivariate normality. Clarke and Van Gorder (2013) derived the density of the ratio of the true regression coefficient to that of the true geometric mean coefficient and use it to construct confidence intervals for the true regression coefficient, also for the two-variable case. Alternatively, nonparametric multivariate bootstrapping may be used to construct confidence limits for the slope coefficients. Advantages of the bootstrap technique are that no as- sumptions regarding the form of the joint distribution of the variables are necessary, and it can be used to cal- culate confidence limits using estimates obtained from both standardized and unstandardized data.

Consider the two-variable case discussed in section 3a, and let the true slope coefficient be denoted by u.To construct the bootstrap distribution, randomly sample, with replacement, n bivariate pairs from the original sample and compute the slope coefficient for each sample. Let the quantity ... denote the slope coefficient associated with a particular bootstrap sample. This is repeated for many random samples from the original sample and for the values of ... collected to obtain an estimate of the sampling distribution of .... Quantiles of this bootstrap sampling distribution can be used to provide confidence limits for the true co- efficient. For 100(1 2 a)% confidence, the interval is (...). Recall from example 1 in section 3a, the estimated coefficient from the MA method was calcu- lated as .... Using 10 000 bivariate bootstrap samples, a 95% confidence interval for the true co- efficient is found to be 1.443 = u = 3.163.

For the multivariate case, the same process can be used. For example, for the three-variable case described in example 2 of section 4a random triplets (PIM, POM, b555) are selected, with replacement, and the slope co- efficients for PIM and POM calculated each time. This results in two bootstrap distributions, one for each of the coefficients. Then the appropriate quantiles of the bootstrap sampling distributions provide confidence limits. This process can be extended to any number of variables. In example 2, the estimates using the MA method [Eq. (3)] were 0.558 and 2.000 for PIM and POM, respectively. Again, using 10 000 multivariate bootstrap samples, the 95% confidence intervals are 0.159 = uPIM = 0.754 for the coefficient of PIM and 0.750 = uPOM = 6.382 for the coefficient of POM.

Note that in example 2, the variability in the observed values of PIM was substantially higher than that of POM (the variance of PIM is almost 24 times as large). The result is a much larger slope estimate for POM com- pared to PIM, and a great deal of variability in the con- fidence interval estimates. As was discussed in section 3b, when the variability of the variables is substantially dif- ferent, calculations on standardized values may yield more useful results. Using the SMA method [Eq. (4)], the slope for PIM is estimated as 0.733, with 95% con- fidence interval 0.558 = uPIM = 0.825, while the slope estimate for POM is 0.764, with 95% confidence interval 0.121=uPOM =1.730.

5. Discussion The importance of multivariate models in oceanog- raphy and environmental studies is becoming increasingly recognized, and more studies are taking advantage of this point of view. In situations where determining the best functional relation is the goal, the results in section 4 provide straightforward extensions of the MA and SMA methods that can be used to find slope coefficients to estimate a functional relation in a multivariate sys- tem. There are many examples in the literature where the parameters of a multivariate model have been esti- mated using estimates from a series of bivariate models. However, this can result in incorrect estimates, since when the variables in the system are correlated, the slope coefficient obtained in a bivariate model is af- fected by the omitted variables.

There have been attempts to perform analyses of the ocean environment similar to that possible with model II multiple regression by a bivariate approximation to the mass-specific scattering cross section called the mass- specific scattering coefficient, which is equal to bl/TSS, the ratio of the total volume scattering coefficient to the total suspended solids (the sum of suspended mineral matter and suspended organic matter) (Babin et al. 2003). There are two problems with this empirical ratio: the volume total scattering coefficient, bl , is not parti- tioned into mineral and organic components, and TSS is not partitioned into mineral and organic components. This empirical ratio has then been recorded for coastal waters and open ocean waters, the average value of each oceanic system being 1.0 and 0.5 m2 g21 at 555 nm, re- spectively, declared as a mass-specific scattering co- efficient for open ocean and coastal ocean, respectively (Babin et al. 2003). This assertion has been based on assuming that coastal ocean water contains "mostly suspended inorganic matter" and that the open ocean water contains "mostly suspended organic matter." The mass-specific scattering coefficient has then been used in the same manner as the true optical mass-specific scat- tering cross sections. The determinations of the mass- specific scattering cross sections for the northern Gulf of Mexico are illustrated in Table 3. We can see immedi- ately that the empirical ratio reported in Babin et al. (2003) provides no insight into the optical properties and therefore the efficacy of this bivariate parameter for modeling and prediction of photon budgets, etc., of the northern coastal Gulf of Mexico.

The assertion that the approximation to the mass- specific scattering cross section provided by the coastal average of b555/TSS is 0.5 m2 g21 simply does not stand up to scrutiny. Stavn and Richter (2008) used multivar- iate model II methods to determine the functional re- lations of the optical volume scattering coefficient of the ocean hydrosol and the concentration of suspended mineral and organic matter in the ocean. The model II multivariate analysis yields the mass-specific optical scattering cross sections of suspended mineral and or- ganic matter. These optical cross sections allow the in- version of the suspended mass of mineral and organic matter by their separate contributions to the remote sensing reflectance signal detected by satellites and aircraft. Their magnitude and spectral slope are related to the modal size of the suspended matter size distri- bution and their fractal dimensions, respectively (Stavn 2012). Furthermore, these cross sections allow the par- titioning of the hydrosol scattering coefficient into the scattering coefficient due to suspended mineral matter and the scattering coefficient due to suspended organic matter. From these relations it is possible to derive al- gorithms and models of photon penetration and re- flectance, determine a photon budget, and quantitatively account for the components of the remote sensing signal. The values of s(PIM) reported in Stavn and Richter (2008) vary from 0.57 to 0.76 m2 g21 for the northern Gulf of Mexico in a spectral range comparable to the results re- ported by Babin et al. (2003). Furthermore, the organic values in Table 3 are significantly less than the organic value of 1.0 m2 g21 reported for the empirical ratio, while the Southwest Pass organic value actually approaches the value of the empirical ratio reported for suspended minerals. The closest value of Table 3 to that reported for suspended organic matter (Babinetal.2003)isthe 0.98 m2 g21 calculated for lith plates by Stavn and Richter (2008), except that the lith plates are essentially pure mineral, CaCO3, that would be expected from coccoli- thophores growing in the open ocean. We see, then, that the empirical ratio bl/TSS does not supply unequivocal information about the optical nature nor the type of suspended matter in the coastal ocean.

Model II methods are often recommended when all variables are measured subject to error (Sokol and Rholf 1995). However, as Warton et al. (2006) point out, this is not the most effective justification for using model II methods. Rather, model II methods should be pre- ferred when the goal is to estimate the linear relation that best describes the scatter of the variables. An ex- ample of this use was the estimation of scattering co- efficients discussed earlier in this section. Alternatively, if the goal is prediction of one variable value based on the values of one or more predictors, then model I re- gression is usually preferred.

It should also be noted that in addition to potential measurement error, which also may include sampling error, equation error may also exist. If the equation is misspecified, then estimates for both model I and model II methods will generally be biased. Further research is needed to better understand properties of multiple model II estimates.

6. Conclusions A straightforward procedure for estimating parame- ters in a model II multiple regression has been pre- sented. The utilization of MA and SMA and where each may be most appropriate was investigated. The pro- cedure was applied to determine the mass-specific scattering cross section of suspended mineral and or- ganic matter in the ocean. Many field situations pres- ently investigated or analyzed as bivariate regressions are actually embedded in a multivariate system. Multi- variate estimation is preferred in these situations to obtain more accurate estimates.

Acknowledgments. SJR wishes to acknowledge sup- port, in part, by a University of North Carolina at Greensboro Research Assignment. RHS wishes to ac- knowledge the valuable discussions on regression from Charles C. Trees, Center for Hydro-Optics and Remote Sensing, San Diego State University, San Diego Cal- ifornia. In addition there was the valuable support of a Research Assignment Leave from the University of North Carolina at Greensboro; a National Research Council Research Associateship through NASA, Ap- plied Sciences Directorate, Stennis Space Center, Mississippi; a summer fellowship from the American Society for Engineering Education through the Naval Research Laboratory, Code 7323, Coastal and Mar- ginal Seas Section, Stennis Space Center, Mississippi; early support from ONR Grant N00014-97-0812; and the Naval Research Lab Project to Richard W. Gould Jr., "Predicting Coastal Bio-Optical Response to Atmospheric/Oceanographic Forcing," Stennis Space Center, Mississippi.

REFERENCES Anderson, T. W., 1963: Asymptotic theory for principal compo- nent analysis. Ann. Math. Stat., 34, 122-148, doi:10.1214/ aoms/1177704248.

_____, 1984: Estimating linear statistical relationships. Ann. Stat., 12, 1-45.

Babin, M., A. Morel, V. Fournier-Sicre, F. Fell, and D. Stramski, 2003: Light scattering properties of marine particles in coastal and open ocean waters as related to the particle mass con- centration. Limnol. Oceanogr., 48, 843-859, doi:10.4319/ lo.2003.48.2.0843.

Banse, K., 1977: Determining the carbon-to-chlorophyll ratio of natural phytoplankton. Mar. Biol., 41, 199-212, doi:10.1007/ BF00394907.

Clarke, A. J., and S. Van Goder, 2013: On fitting a straight line to data when the "noise" in both variables is unknown. J. Atmos. Oceanic Technol., 30, 151-158, doi:10.1175/ JTECH-D-12-00067.1.

Fichot, C. G., S. Sathyendranath, and W. L. Miller, 2008: SeaUV and SeaUVC: Algorithms for the retrieval of UV/visible dif- fuse attenuation coefficients from ocean color. Remote Sens. Environ., 112, 1584-1602, doi:10.1016/j.rse.2007.08.009.

Gallie, E. A., and P. A. Murtha, 1992: Specific absorption and backscattering spectra for suspended mineral and chlorophyll a in Chilko Lake, British Columbia. Remote Sens. Environ., 39, 103-118, doi:10.1016/0034-4257(92)90130-C.

Ikeda, T., 1970: Relationships between respiration rate and body size in marine plankton animals as a function of the temper- ature of habitat. Bull. Fac. Fish. Hokkaido Univ., 21, 91-112.

Jolicoeur, P., 1968: Interval estimation of the slope of the major axis of a bivariate normal distribution in the case of a small sample. Biometrics, 24, 679-682, doi:10.2307/2528326.

_____, 1975: Linear regression in fishery research: Some comments. J. Fish. Res. Board Can., 32, 1491-1494, doi:10.1139/f75-171.

Kendall, M. G., and A. Stuart, 1977: Inference and Relationship. Vol. 2, The Advanced Theory of Statistics, 4th ed. Griffin, 224 pp.

Kermack, K. A., and J. B. S. Haldane, 1950: Organic correla- tion and allometry. Biometrika, 37, 30-41, doi:10.1093/ biomet/37.1-2.30.

Laws, E. A., 1997: Mathematical Models for Oceanographers. John Wiley & Sons, 343 pp.

_____, and J. W. Archie, 1981: Appropriate use of regression anal- ysis in marine biology. Mar. Biol., 65, 13-16, doi:10.1007/ BF00397062.

Pearson, K., 1901: On lines and planes of closest fit to systems of points in space. Philos. Mag., 2, 559-572, doi:10.1080/ 14786440109462720.

R Core Team, 2012: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Available online at http://www.R-project.org/.] Rencher, A. C., 2002: Methods of Multivariate Analysis. 4th ed. John Wiley & Sons, 708 pp.

Ricker, W. E., 1973: Linear regressions in fishery research. J. Fish. Res. Board Can., 30, 409-434, doi:10.1139/f73-072.

Schlesinger, W. H., J. J. Cole, A. C. Finzi, and E. A. Holland, 2011: Introduction to coupled biogeochemical cycles. Front. Ecol. Environ., 9, 5-8, doi:10.1890/090235.

Sokal, R. R., and F. J. Rohlf, 1995: Biometry: The Principles and Practice of Statistics in Biological Research. 3rd ed. W.H. Freeman & Company, 885 pp.

Sprent, P., and G. R. Dolby, 1980: The geometric mean functional relationship. Biometrics, 36, 547-550, doi:10.2307/2530224.

Stavn, R. H., 2012: Mass-specific scattering cross sections of suspended sediments and aggregates: Theoretical limits and applications. Opt. Express, 20, 201-219, doi:10.1364/ OE.20.000201.

_____, and S. J. Richter, 2008: Biogeo-optics: Particle optical prop- erties and the partitioning of the spectral scattering coefficient of ocean waters. Appl. Opt., 47, 2660-2679, doi:10.1364/ AO.47.002660.

Steele, J. H., and I. E. Baird, 1961: Relations between primary production, chlorophyll and particulate carbon. Limnol. Oce- anogr., 6, 68-78, doi:10.4319/lo.1961.6.1.0068.

Sverdrup, H., 1916: Druckgradient, Wind und Reibung an der Erdoberfläche. Ann. Hydrogr. Marit. Meteor., 44, 413-427.

Tett, P., J. C. Cottrell, D. O. Trew, and B. J. B. Wood, 1975: Phosphorus quota and the chlorophyll : carbon ratio in marine plankton. Limnol. Oceanogr., 20, 587-603, doi:10.4319/ lo.1975.20.4.0587.

Warton, D. I., I. J. Wright, D. S. Falster, and M. Westoby, 2006: Bivariate line-fitting methods for allometry. Biol. Rev., 81, 259-291, doi:10.1017/S1464793106007007.

SCOTT J. RICHTER Department of Mathematics and Statistics, University of North Carolina at Greensboro, Greensboro, North Carolina ROBERT H. STAVN Oceanography Division, Naval Research Laboratory, Stennis Space Center, Mississippi (Manuscript received 1 October 2013, in final form 3 April 2014) Corresponding author address: Robert H. Stavn, Biology Department, University of North Carolina at Greensboro, 312 Eberhart Building, 321 McIver Street, Greensboro, NC 27402-6170.

E-mail: [email protected]; [email protected] DOI: 10.1175/JTECH-D-13-00210.1 (ProQuest: Appendix omitted.) (c) 2014 American Meteorological Society

[ Back To TMCnet.com's Homepage ]