(Ebook) Contemporary Statistical Models For The Plant and Soil Sciences by Oliver Schabenberger Francis J. Pierce ISBN 9781584881117, 1584881119
(Ebook) Contemporary Statistical Models For The Plant and Soil Sciences by Oliver Schabenberger Francis J. Pierce ISBN 9781584881117, 1584881119
https://ebooknice.com/product/statistical-methods-for-spatial-data-
analysis-10867866
(Ebook) Basic Statistical Methods and Models for the Sciences by Judah
Rosenblatt (Author) ISBN 9780367396459, 9780367827960, 9781315273754,
9781351990240, 9781351993128, 9781420057706, 9781482285659,
9781584881476
https://ebooknice.com/product/basic-statistical-methods-and-models-
for-the-sciences-12052750
https://ebooknice.com/product/plant-nutrition-and-soil-fertility-
manual-4181698
(Ebook) Laboratory Guide for Conducting Soil Tests and Plant Analysis
by J. Benton Jones Jr. ISBN 9780849302060, 9781420025293, 0849302064,
1420025295
https://ebooknice.com/product/laboratory-guide-for-conducting-soil-
tests-and-plant-analysis-1330772
(Ebook) Agricultural Automation Fundamentals and Practices by Pierce,
Francis J.; Zhang, Qin ISBN 9781439880586, 9781448659814, 1439880581,
1448659817
https://ebooknice.com/product/agricultural-automation-fundamentals-
and-practices-32704576
https://ebooknice.com/product/gadamer-and-ricoeur-critical-horizons-
for-contemporary-hermeneutics-2624782
https://ebooknice.com/product/plant-nutrition-and-soil-fertility-
manual-second-edition-5277674
https://ebooknice.com/product/building-statistical-models-in-python-
develop-useful-models-for-regression-classification-time-series-and-
survival-analysis-55316492
CONTEMPORARY
STATISTICAL MODELS
for the Plant and Soil Sciences
Oliver Schabenberger
Francis J. Pierce
CRC PR E S S
Boca Raton London New York Washington, D.C.
© 2002 by CRC Press LLC
1405/disclaimer Page 1 Tuesday, October 2, 2001 9:43 AM
Schabenberger, Oliver.
Contemporary statistical models for the plant and soil sciences / Oliver Schabenberger
and Francis J. Pierce.
p. cm.
Includes bibliographical references (p. ).
ISBN 1-58488-111-9 (alk. paper)
1. Plants, Cultivated—Statistical methods. 2. Soil science—Statistical methods. I.
Pierce, F. J. (Francis J.) II. Title.
This book contains information obtained from authentic and highly regarded sources. Reprinted material is quoted with
permission, and sources are indicated. A wide variety of references are listed. Reasonable efforts have been made to publish
reliable data and information, but the author and the publisher cannot assume responsibility for the validity of all materials
or for the consequences of their use.
Neither this book nor any part may be reproduced or transmitted in any form or by any means, electronic or mechanical,
including photocopying, microfilming, and recording, or by any information storage or retrieval system, without prior
permission in writing from the publisher.
The consent of CRC Press LLC does not extend to copying for general distribution, for promotion, for creating new works,
or for resale. Specific permission must be obtained in writing from CRC Press LLC for such copying.
Direct all inquiries to CRC Press LLC, 2000 N.W. Corporate Blvd., Boca Raton, Florida 33431.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for
identification and explanation, without intent to infringe.
Preface
About the Authors
1 Statistical Models
1.1 Mathematical and Statistical Models
1.2 Functional Aspects of Models
1.3 The Inferential Steps — Estimation and Testing
1.4 >-Tests in Terms of Statistical Models
1.5 Embedding Hypotheses
1.6 Hypothesis and Significance Testing — Interpretation of the : -Value
1.7 Classes of Statistical Models
1.7.1 The Basic Component Equation
1.7.2 Linear and Nonlinear Models
1.7.3 Regression and Analysis of Variance Models
1.7.4 Univariate and Multivariate Models
1.7.5 Fixed, Random, and Mixed Effects Models
1.7.6 Generalized Linear Models
1.7.7 Errors in Variable Models
2 Data Structures
2.1 Introduction
2.2 Classification by Response Type
2.3 Classification by Study Type
2.4 Clustered Data
2.4.1 Clustering through Hierarchical Random Processes
2.4.2 Clustering through Repeated Measurements
2.5 Autocorrelated Data
2.5.1 The Autocorrelation Function
2.5.2 Consequences of Ignoring Autocorrelation
2.5.3 Autocorrelation in Designed Experiments
2.6 From Independent to Spatial Data — a Progression of Clustering
5 Nonlinear Models
5.1 Introduction
5.2 Models as Laws or Tools
5.3 Linear Polynomials Approximate Nonlinear Models
5.4 Fitting a Nonlinear Model to Data
5.4.1 Estimating the Parameters
5.4.2 Tracking Convergence
5.4.3 Starting Values.
5.4.4 Goodness-of-Fit
5.5 Hypothesis Tests and Confidence Intervals
5.5.1 Testing the Linear Hypothesis
5.5.2 Confidence and Prediction Intervals
5.6 Transformations
5.6.1 Transformation to Linearity
5.6.2 Transformation to Stabilize the Variance
5.7 Parameterization of Nonlinear Models
5.7.1 Intrinsic and Parameter-Effects Curvature
5.7.2 Reparameterization through Defining Relationships
5.8 Applications
5.8.1 Basic Nonlinear Analysis with The SAS® System — Mitscherlich's
Yield Equation
5.8.2 The Sampling Distribution of Nonlinear Estimators —
the Mitscherlich Equation Revisited
5.8.3 Linear-Plateau Models and Their Relatives — a Study of Corn
Yields from Tennessee
5.8.4 Critical R S$ Concentrations as a Function of Sampling Depth —
Comparing Join-Points in Plateau Models
5.8.5 Factorial Treatment Structure with Nonlinear Response
5.8.6 Modeling Hormetic Dose Response through Switching Functions
5.8.7 Modeling a Yield-Density Relationship
5.8.8 Weighted Nonlinear Least Squares Analysis with
Heteroscedastic Errors
To the Reader
Statistics is essentially a discipline of the twentieth century, and for several decades it was
keenly involved with problems of interpreting and analyzing empirical data that originate in
agronomic investigations. The vernacular of experimental design in use today bears evidence
of the agricultural connection and origin of this body of theory. Omnipresent terms, such as
block or split-plot, emanated from descriptions of blocks of land and experimental plots in
agronomic field designs. The theory of randomization in experimental work was developed
by Fisher to neutralize in particular the spatial effects among experimental units he realized
existed among field plots. Despite its many origins in agronomic problems, statistics today is
often unrecognizable in this context. Numerous recent methodological approaches and
advances originated in other subject-matter areas and agronomists frequently find it difficult
to see their immediate relation to questions that their disciplines raise. On the other hand,
statisticians often fail to recognize the riches of challenging data analytical problems
contemporary plant and soil science provides. One could gain the impressions that
• statistical methods of concern to plant and soil scientists are completely developed and
understood;
• the analytical tools of classical statistical analysis learned in a one- or two-semester
course for non-statistics majors are sufficient to cope with data analytical problems;
• recent methodological work in statistics applies to other disciplines such as human
health, sociology, or economics, and has no bearing on the work of the agronomist;
• there is no need to consider contemporary statistical methods and no gain in doing so.
These impressions are incorrect. Data collected in many investigations and the circum-
stances under which they are accrued often bear little resemblance to classically designed ex-
periments. Much of the data analysis in the plant and soil sciences is nevertheless viewed in
the experimental design framework. Ground and remote sensing technology, yield monito-
ring, and geographic information systems are but a few examples where analysis cannot
necessarily be cast, nor should it be coerced, into a standard analysis of variance framework.
As our understanding of the biological/physical/environmental/ecological mechanisms in-
creases, we are more and more interested in what some have termed the space/time dynamics
of the processes we observe or set into motion by experimentation. It is one thing to collect
data in space and/or over time, it is another matter to apply the appropriate statistical tools to
infer what the data are trying to tell us. While many of the advances in statistical
methodologies in past decades have not explicitly focused on agronomic applications, it
would be incorrect to assume that these methods are not fruitfully applied there. Geostatistical
methods, mixed models for repeated measures and longitudinal data, generalized linear
models for non-normal (= non-Gaussian) data, and nonlinear models are cases in point.
To the User
Contemporary statistical models cannot be appreciated to their full potential without a
good understanding of theory. Hence, we place emphasis on that. They also cannot be applied
to their full potential without the aid of statistical software. Hence, we place emphasis on that.
The main chapters are roughly equally divided between coverage of essential theory and
applications. Additional theoretical derivations and mathematical details needed to develop a
deeper understanding of the models can be found on the companion CD-ROM. The choice to
focus on The SAS® System for calculations was simple. It is, in our opinion, the most
powerful statistical computing platform and the most widely available and accepted com-
puting environment for statistical problems in academia, industry, and government. In rare
cases when procedures in SAS® were not available and macros too cumbersome we
employed the S-PLUS® package, in particular the S+SpatialStats® module. The important
portions of the executed computer code are shown in the text along with the output. All data
sets and SAS® or S-PLUS® codes are contained on the CD-ROM.
To the Instructor
This text is both a reference and textbook and was developed with a reader in mind who
has had a first course in statistics, covering simple and multiple linear regression, analysis of
variance, who is familiar with the principles of experimental design and is willing to absorb a
Francis J. Pierce is the director of the center for precision agricultural systems at Washington
State University, located at the WSU Irrigated Agriculture
Research & Extension Center (IRAEC) in Prosser, Washington.
He is also a professor in the departments of crop and soil sciences
and biological systems engineering and directs the WSU Public
Agricultural Weather System. Dr. Pierce received his M.S. and
Ph.D. degrees in soil science from the University of Minnesota in
1980 and 1984. He spent the next 16 years at Michigan State
University, where he has served as professor of soil science in the
department of crop and soil sciences since 1995. His expertise is
in soil management and he has been involved in the development
and evaluation of precision agriculture since 1991. The Center for
Precision Agricultural Systems was funded by the Washington
Legislature as part of the University's Advanced Technology Initiative in 1999. As center
director, Dr. Pierce's mission is to advance the science and practice of precision agriculture in
Washington. The center's efforts will support the competitive production of Washington's
agricultural commodities, stimulate the state's economic development, and protect the region's
environmental and natural resources. Dr. Pierce has edited three other books, Soil
Management for Sustainability, Advances in Soil Conservation, and The State of Site-Specific
Management for Agriculture.
© 2002 by CRC Press LLC
Chapter 1
Statistical Models
“A theory has only the alternative of being right or wrong. A model has a
third possibility: it may be right, but irrelevant.” Manfred Eigen. In the
Physicist's Conception of Nature (Jagdish Mehra, Ed.) 1973.
The ability to represent phenomena and processes of the biological, physical, chemical, and
social world through models is one of the great scientific achievements of humankind. Scien-
tific models isolate and abstract the elementary facts and relationships of interest and provide
the logical structure in which a system is studied and from which inferences are drawn.
Identifying the important components of a system and isolating the facts of primary interest is
necessary to focus on those aspects relevant to a particular inquiry. Abstraction is necessary
to cast the facts in a logical system that is concise, deepens our insight, and is understood by
others to foster communication, critique, and technology transfer. Mathematics is the most
universal and powerful logical system, and it comes as no surprise that most scientific models
in the life sciences or elsewhere are either developed as mathematical abstractions of real phe-
nomena or can be expressed as such. A purely mathematical model is a mechanistic
(= deterministic) device in that for a given set of inputs, it predicts the output with absolute
certainty. It leaves nothing to chance. Beltrami (1998, p. 86), for example, develops the
following mathematical model for the concentration ! of a pollutant in a river at point = and
time >:
!a=ß >b !! a= ->bexpe .>f. [1.1]
where / is a random variable with mean !, variance 5 # , and some probability distribution.
Allowing for the random deviation /, model [1.2] now states explicitly that !a=ß >b is a ran-
dom variable and the expression !! a= ->bexpe .>f is the expected value or average
pollutant concentration,
Ec!a=ß >bd !! a= ->bexpe .>f.
Olkin et al. (1978, p. 4) conclude: “The assumption that chance phenomena exist and can
be described, whether true or not, has proved valuable in almost every discipline.” Of the
many reasons for incorporating stochastic elements in scientific models, an incomplete list
includes the following.
• The model is not correct for a particular observation, but correct on average.
• Omissions and assumptions are typically necessary to abstract a phenomenon.
• Even if the nature of all influences were known, it may be impossible to measure or
even observe all the variables.
• Scientists do not develop models without validation and calibration with real data. The
innate variability (nonconstancy) of empirical data stems from systematic and random
effects. Random measurement errors, observational (sampling) errors due to sampling
a population rather than measuring its entirety, experimental errors due to lack of
homogeneity in the experimental material or the application of treatments, account for
stochastic variation in the data even if all systematic effects are accounted for.
• Randomness is often introduced deliberately because it yields representative samples
from which unbiased inferences can be drawn. A random sample from a population
will represent the population (on average), regardless of the sample size. Treatments
are assigned to experimental units by a random mechanism to neutralize the effects of
unaccounted sources of variation which enables unbiased estimates of treatment
means and their differences (Fisher 1935). Replication of treatments guarantees that
experimental error variation can be estimated. Only in combination with
randomization will this estimate be free of bias.
• Stochastic models are often more parsimonious than deterministic models and easier to
study. A deterministic model for the germination of seeds from a large lot, for
example, would incorporate a plethora of factors, their actions and interactions. The
plant species and variety, storage conditions, the germination environment, amount of
non-seed material in the lot, seed-to-seed differences in nutrient content, plant-to-plant
interactions, competition, soil conditions, etc. must be accounted for. Alternatively, we
can think of the germination of a particular seed from the lot as a Bernoulli random
variable with success (germination) probability 1. That is, if ]3 takes on the value " if
seed 3 germinates and the value ! otherwise, then the probability distribution of ]3 is
simply
1 C3 "
:aC3 b
"1 C3 ! .
Statistical models, in terminology that we adopt for this text, are stochastic models that
contain unknown constants (parameters). In the river pollution example, the model
!a=ß >b !! a= ->bexpe .>f /, Ec/d !ß Varc/d 5 #
is a stochastic model if all parameters a!! ß -ß .ß 5 # b are known. (Note that / is not a constant
but a random variable. Its mean and variance are constants, however.) Otherwise it is a statis-
tical model and those constants that are unknown must be estimated from data. In the seed
germination example, the germination probability 1 is unknown, hence the model
1 C3 "
: a C3 b
"1 C3 !
is a statistical one. The parameter 1 is estimated based on a sample of 8 seeds from the lot.
This usage of the term parameter is consistent with statistical theory but not necessarily with
modeling practice. Any quantity that drives a model is often termed a parameter of the
model. We will refer to parameters only if they are unknown constants. Variables that can be
measured, such as plant density in the model of a yield-density relationship are, not parame-
ters. The rate of change of plant yield as a function of plant density is a parameter.
• Statistical models represent a mechanism from which data with the same
statistical properties as the observed data can be generated.
The model errors /3 are assumed to be independent and identically distributed a33. b accor-
ding to a Gaussian distribution (we use this denomination instead of Normal distribution
throughout) with mean ! and variance 5 # . As a consequence, ]3 is also distributed as a
Gaussian random variable with mean Ec]3 d "! "" B3 and variance 5 # ,
]3 µ K"! "" B3 ß 5 # .
The ]3 are not identically distributed because their means are different, but they remain inde-
pendent (a result of drawing a random sample). Since a Gaussian distribution is completely
specified by its mean and variance, the distribution of the ]3 is completely known, once
values for the parameters "! , "" , and 5 # are known. For many statistical purposes the assump-
tion of Gaussian errors is more than what is required. To derive unbiased estimators of the
intercept "! and slope "" , it is sufficient that the errors have zero mean. A simple linear
regression model with lesser assumptions than [1.3] would be, for example,
]3 "! "" B3 /3 , /3 µ 33. !ß 5 # .
The errors are assumed independent zero mean random variables with equal variance (homo-
scedastic), but their distribution is otherwise not specified. This is sometimes referred to as
the first-two-moments specification of the model. Only the mean and variance of the ]3 can
be inferred:
Ec]3 d "! "" B3
Varc]3 d 5 # .
If the parameters "! , "" , and 5 # were known, this model would be an incomplete description
of the distributional properties of the response ]3 . Implicit in the description of distributional
properties is a separation of variability into known sources, e.g., the dependency of ] on B,
and unknown sources (error) and a description of the form of the dependency. Here, ] is
assumed to depend linearly on the regressor. Expressing which regressors ] depends on indi-
vidually and simultaneously and how this dependency can be crafted mathematically is one
important aspect of statistical modeling.
To conceptualize what constitutes a useful statistical model, we appeal to what we con-
sider the most important functional aspect. A statistical model provides a mechanism to gene-
rate the essence of the data such that the properties of the data generated under the model are
statistically equivalent to the observed data. In other words, the observed data can be con-
sidered as one particular realization of the stochastic process that is implied by the model. If
the relevant features of the data cannot be realized under the assumed model, it is not useful.
The upper left panel in Figure 1.1 shows 8 #" yield observations as a function of the
amount of nitrogen fertilization. Various candidate models exist to model the relationship
between plant yield and fertilizer input. One class of models, the linear-plateau models
Language: English
Compliments of
The San Francisco and North Pacific Railway,
The Picturesque Route of California.
POMO INDIAN IN DANCING COSTUME MADE
FROM THE FEATHERS OF THE EAGLE
F or forty-six miles The San Francisco and North Pacific Railway, the
Picturesque Route of California, skirts the Russian River. A few of
the many lovely views en-route are shown in this little brochure.
This beautiful country was once occupied by the Pomo Indians, a
brave and flourishing tribe, of whom but very few families remain.
Many of the local features are hallowed by the Indian folk-lore. The
legend of Dah-nol-yo—Squaw Rock, so charmingly written by Mrs. H.
M. Carpenter, of Ukiah, is presented to the reader.
DAH-NOL-YO—Squaw Rock.
THE LEGEND OF DAH-NOL-YO—
SQUAW ROCK.
Among the many extant Indian Legends of Squaw Rock, none has
appeared which relates to the face of stone, on the summit of this
noted mountain. The following is substantially as given by a direct
descendant of Con-che Tadi, a Pomo Chief, who lived in the shadow
of Dah-nol-yo, when time was young.
This Chief was particularly noted for preserving the peace and
harmony of his own tribe and others with which he came in contact.
His faithful squaw Ma-cha-ta (Kingfisher) devoted her time to her
two sons, Ca-ba-ba (Brave one) and Bu-tah-so (Little Bear); and as
is the custom at the present day, gathered the buckeye and acorn,
as a means of subsistence, while her lord smoked his pipe or
enjoyed a nap in the sunshine. As the sons approached manhood
they were inseparable companions and partook of the peaceful
disposition of their illustrious father.
TWIN ROCKS, ON THE RUSSIAN RIVER—NEAR DAH-NOL-
YO.
One day, after vainly endeavoring to catch enough fish for dinner, a
very beautiful Indian woman suddenly appeared before them. When
she merely looked into the water, fish and turtles came out upon the
sand and lay at her feet. She pointed to the east and west, and all
kinds of birds and beasts were at once before her. Ca-ba-ba, the
elder son, who was to be Chief of the tribe when the Tadi went to
the “Happy Hunting Ground,” conceived the idea of making this
young woman his wife, feeling assured that at least a good living
was in store for him through this arrangement. Divining his thoughts
(which did not meet with her approval, as it is supposed she was
already enamored of Bu-tah-so,) she declared, in consequence of his
selfish motives, he should never be the Chief of his tribe, which so
enraged Ca-ba-ba that he threw a fishing spear at her. This missed
its fair mark and was buried in the bosom of Bu-tah-so. All was
immediately in darkness—thunder rolled, lightning flashed and the
whole earth was convulsed. From out the storm the woman’s voice
was heard pronouncing maledictions on the head of Ca-ba-ba and
commanding him to hide himself in Dah-nol-yo, and do penance for
all time. While he suffered in darkness, she said the face of Bu-tah-
so should stand upon his sepulchre as a warning to all evil doers.
The disappearance of the Chief’s sons and the face of one of them,
engraved in stone, upon the mountain overlooking their home, so
frightened the tribe that they fled to the north, and no tribe has
since had the temerity to live in sight of the face on Dah-nol-yo.
CAMP SCENE ON RUSSIAN RIVER ON LINE SAN
FRANCISCO AND NORTH PACIFIC RY.
Here the legend ends, and we take up the threads and weave a little
net of well authenticated facts. When the San Francisco and North
Pacific Railway wound through the picturesque canyon of Che-hool-
be-da-dah (Russian River) a tunnel pierced the base of Dah-nol-yo,
thus opening the door of Ca-ba-ba’s prison and liberating him. In
fear of detection, and a return to his solitary quarters, he cautiously
crept to his childhood’s home only to find desolation. Turning his
eyes to the summit of Dah-nol-yo he was transfixed with terror at
beholding the face of Bu-tah-so looking grimly down upon him. Held
by an invisible power he gazes on, unable to turn his eyes even to
the bear, deer, quail and squirrel, that venture so near, peering in
wonder at the lone man who is to suffer on through all eternity as
he keeps his silent vigil of Dah-nol-yo.
POMO HUT.
RUSSIAN RIVER, GUERNEVILLE IN THE DISTANCE.
BOATING ON RUSSIAN RIVER.
BOHEMIAN GROVE—NEAR GUERNEVILLE. These trees
range from 200 to 300 feet in height.
T he way to reach the Russian River and its scenic surroundings is
by the San Francisco and North Pacific Railway, the Picturesque
Route of California.
Continuous riding over this road does not become monotonous,
owing to the variety and constant change of scenery, traversing, as it
does, Marin, Sonoma, Russian River, Sanel and Ukiah Valleys.
The hand of man, in dotting the country with orchards, vineyards,
grain fields, homes, towns and villages, has most beautifully blended
the domestic with nature’s wildness.
From San Francisco to Ukiah, the terminus of the road, is only 113
miles.
The Counties tributary, Marin, Sonoma, Mendocino and Lake, are
known as the most fertile on the Pacific Coast. The winds from the
Pacific Ocean, tempered by the bordering coast range and redwood
forests, produce a mild and salubrious climate so enjoyable to the
healthy and refreshing to the weak.
The Hotels and Resorts, and the numerous and various mineral
springs, afford every opportunity for health and recreation.
This section is specially desirable to the home-seeker, owing to the
variety and profusion of products which can be raised, and without
irrigation; for in this section, renowned for its richness and fertility
and wonderful soil products, there is not one acre under
irrigation.
Ticket Office, 650 Market Street, Chronicle Building.
A. W. Foster, H. C. Whiting,
Pres. and Gen’l Manager. General Superintendent.
R. X. Ryan,
Gen’l Pass’r and Frt. Agt.
Transcriber’s Notes:
The author is Helen McCowen Carpenter (1839-1917).
This pamphlet is not dated, but a review of it appeared in The San Francisco Call, Volume
81, Number 163, May 12, 1897, p. 7.
Punctuation has been made consistent.
The paragraphs beginning “One day, after ...” and “Here the legend ends, ...” have been
transposed.
*** END OF THE PROJECT GUTENBERG EBOOK THE LEGEND OF
DAH-NOL-YO, SQUAW ROCK ***
Updated editions will replace the previous one—the old editions will
be renamed.
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
ebooknice.com