3. what hypothesis states no equality or existence of differences, relationship, or effect? *

The null hypothesis is a typical statistical theory which suggests that no statistical relationship and significance exists in a set of given single observed variable, between two sets of observed data and measured phenomena.

From: Mineral Exploration, 2013

Statistical and Geostatistical Applications in Geology

Swapan Kumar Haldar, in Mineral Exploration (Second Edition), 2018

9.2.14 The Null Hypothesis

The null hypothesis is a characteristic arithmetic theory suggesting that no statistical relationship and significance exists in a set of given, single, observed variables between two sets of observed data and measured phenomena. The hypotheses play an important role in testing the significance of differences in experiments and between observations. H0 symbolizes the null hypothesis of no difference. It presumes to be true until evidence indicates otherwise. Let us take two sets of mill feed silver samples from Table 9.5 and compare the mean grade between set and population and between two sets. The null hypothesis presumes and states that:

Table 9.5. Average Monthly Mill Feed Silver Grade (g/t) of a Zinc-Lead-Silver Mine

SampleSet ISet IIDifference (d) Set (I–II)(Difference)2
April 2010 49 51 −2 4
May 38 31 7 49
June 33 33 0 0
July 41 41 0 0
August 43 40 3 9
September 52 43 9 81
October 45 51 −6 36
November 41 32 9 81
December 36 35 1 1
January 2011 33 30 3 9
N 10 10 10 10
SUM (Ʃ) 411 387 24 270
AVG(X¯) 41.1 38.7 2.4 27.0
VAR (S2) 40.77 61.57 23.60 1078.67
STD (S) 6.38 7.85 4.86 32.84

H0:μ1=μ0

where H0 =  null hypothesis of no difference, μ1 =  mean of population 1, and μ0 =  mean of population 2.

The null hypothesis states that the mean μ1 of the parent population from which the samples are drawn is equal to or not different from the mean of the other population μ0. The samples are drawn from the same population such that the variance and shape of the distributions are also equal. Alternative statistical applications such as t, F, and chi-square can only reject a null hypothesis or fail to reject it. The evidence can state that the mean of the population from which the samples are drawn does not equal the specified population mean and is expressed as:

H0:μ1≠μ0

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780128140222000095

Spatial Distribution

M.K. Borregaard, ... G. Nachman, in Encyclopedia of Ecology (Second Edition), 2008

As the null hypothesis, it is assumed that individuals in a population are randomly distributed among the n sampling units of a sample. If this is the case, it is expected that the variance should equal the average so that the ‘index of dispersion’, s2/x̄, is approximately equal to 1. If the ratio exceeds 1, it indicates that the population has a patchy (or clumped) distribution whereas a value less than unity indicates an even (or regular) distribution. However, since data originate from sampling, they will always be associated with some variation, so it is likely that some deviation in s2/x̄ from unity will occur even if the underlying distribution is random. Especially if the sample size n is small, s2/x̄ will exhibit large variation due to sampling noise. A χ2-test can be used for testing whether s2/x̄ deviates significantly from 1 since χ2= (n−1)s2/x̄ with n−1 degrees of freedom. It should be noted that the test is two-tailed (in contrast to the majority of cases where χ2-tests are used) since values significantly smaller or larger than n−1 can lead to rejection of the null hypothesis.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780444637680006594

Frequentist Statistical Inference

Daniel S. Wilks, in Statistical Methods in the Atmospheric Sciences (Fourth Edition), 2019

Chi-Square Test

The chi-square (χ2) test is a simple and common goodness-of-fit test. It essentially compares a data histogram with the probability distribution (for discrete variables) or probability density (for continuous variables) function. The χ2 test actually operates more naturally for discrete random variables, since to implement it the range of the data must be divided into discrete classes, or bins. When alternative tests are available for continuous data they are usually more powerful, presumably at least in part because the rounding of data into bins, which may be severe, discards information. However, the χ2 test is easy to implement and quite flexible, being for example, very straightforward to implement for multivariate data.

For continuous random variables, the probability density function is integrated over each of some number of MECE classes to obtain probabilities for data values in each class. Regardless of whether the data are discrete or continuous, the test statistic involves the counts of data values falling into each class in relation to the computed theoretical probabilities,

(5.14)χ2=∑classes#Observed−#Expected2#Expected=∑classes#Observed−nPrdata in class2n Prdata in class.

In each class, the number (#) of data values expected to occur, according to the fitted distribution, is simply the probability of occurrence in that class multiplied by the sample size, n. This number of expected occurrences need not be an integer value. If the fitted distribution is very close to the data distribution, the expected and observed counts will be very close for each class, and the squared differences in the numerator of Equation 5.14 will all be very small, yielding a small χ2. If the fit is not good, at least a few of the classes will exhibit large discrepancies. These will be squared in the numerator of Equation 5.14 and lead to large values of χ2. It is not necessary for the classes to be of equal width or equal probability, but classes with small numbers of expected counts should be avoided. Sometimes a minimum of five expected events per class is imposed.

Under the null hypothesis that the data were drawn from the fitted distribution, the sampling distribution for the test statistic is the χ2 distribution with parameter ν = (# of classes − # of parameters fit − 1) degrees of freedom. The test will be one-sided, because the test statistic is confined to positive values by the squaring process in the numerator of Equation 5.14, and small values of the test statistic support H0. Right-tail quantiles for the χ2 distribution are given in Table B.3.

Example 5.4

Comparing Gaussian and Gamma Distribution Fits Using the χ2 Test

Consider the gamma and Gaussian distributions as candidates for representing the 1933–82 Ithaca January precipitation data in Table A.2. The approximate maximum likelihood estimators for the gamma distribution parameters (Equations 4.48 or 4.50a, and Equation 4.49) are α = 3.76 and β = 0.52 in. The sample mean and standard deviation (i.e., the Gaussian parameter estimates) for these data are 1.96 in. and 1.12 in., respectively. The two fitted distributions are illustrated in relation to the data in Figure 4.16. Table 5.3 contains the information necessary to conduct the χ2 tests for these two distributions. The precipitation amounts have been divided into six classes, or bins, the limits of which are indicated in the first row of the table. The second row indicates the number of years in which the January precipitation total was within each class. Both distributions have been integrated over these classes to obtain probabilities for precipitation in each class. These probabilities were then multiplied by n = 50 to obtain the expected number of counts.

Table 5.3. The χ2 Goodness-of-Fit Test Applied to Gamma and Gaussian Distributions for the 1933–82 Ithaca January Precipitation Data

Class< 1″1–1.5″1.5–2″2–2.5″2.5–3″> 3″
Observed #5 16 10 7 7 5
Gamma:
 Probability0.161 0.215 0.210 0.161 0.108 0.145
 Expected #8.05 10.75 10.50 8.05 5.40 7.25
Gaussian:
 Probability0.195 0.146 0.173 0.178 0.132 0.176
 Expected #9.75 7.30 8.65 8.90 6.60 8.80

Expected numbers of occurrences in each bin are obtained by multiplying the respective probabilities by n = 50.

Applying Equation 5.14 yields χ2 = 5.05 for the gamma distribution and χ2 = 14.96 for the Gaussian distribution. As was also evident from the graphical comparison in Figure 4.16, these test statistics indicate that the Gaussian distribution fits these precipitation data substantially less well. Under the respective null hypotheses, these two test statistics are drawn from a χ2 distribution with degrees of freedom ν = 6 − 2 − 1 = 3; because Table 5.3 contains six classes, and two parameters (α and β, or μ and σ, for the gamma or Gaussian, respectively) were fit for each distribution.

Referring to the ν = 3 row of Table B.3, χ2 = 5.05 is smaller than the 90th percentile value of 6.251, so the null hypothesis that the data have been drawn from the fitted gamma distribution would not be rejected even at the 10% level. For the Gaussian fit, χ2 = 14.96 is between the tabulated values of 11.345 for the 99th percentile and 16.266 for the 99.9th percentile, so this null hypothesis would be rejected at the 1% level, but not at the 0.1% level. ⋄

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780128158234000055

Cardiovascular Toxicology

P.J. Boor, in Comprehensive Toxicology, 2010

6.10.1 Introduction

Morphologic studies of the heart and blood vessels are a critical part of toxicologic investigations. This is true at all stages of such investigations, from the most basic screening studies to detailed investigations of the mechanism of actions of toxins, xenobiotics, and drugs. It is frequently the gross or microscopic observation of a chemical effect on the heart or blood vessels that is the first indication of a chemical’s adverse effects. Morphologic study as a screening tool is only one of its functions, however. Detailed morphologic studies are also employed as accurate measurements of cellular injury in dose–response studies, especially when semiquantitative, quantitative, or morphometric methods are combined with morphologic observation. Furthermore, subcellular studies of organelles and fine structural details add greatly to our understanding of the pathogenetic mechanisms of toxin-induced injury of the cardiovascular system. Thus, at all levels of study, from simple screening to in-depth characterization of pathogenetic mechanisms, morphologic studies play an important role in toxicologic investigations.

Morphologic methods may well be some of the least well understood of those methods applied in toxicology or other disciplines. Frequently, morphologic methods are viewed as confirmatory, or mere add-ons to biochemical, functional, or other experimental protocols and methods. Also, morphologic methods may be easily and unknowingly misapplied, without proper controls or optimized conditions. Often, inappropriate or less than adequate morphologic methods are chosen in toxicologic studies. These issues can be easily avoided if the necessary elements of morphologic studies are adhered to. Therefore, a brief review of the important aspects of applying morphologic methods is warranted at the outset of this chapter.

6.10.1.1 Elements of a Proper Morphologic Study

It is too often incorrectly assumed that morphologic examination of tissues is somehow different from other scientific methodologies, such as biochemical, biophysical, genetic, or epigenetic techniques. Morphologic data collected during a toxicologic investigation is just like any other form of data and should be treated accordingly. The elements of sound scientific investigation hold for morphologic studies, and these basic elements will be restated with specific reference to morphologic studies.

6.10.1.1.1 Hypothesis

The essential null hypothesis of most morphologic studies is that the morphology of cells and tissues will be unaltered by an experimental manipulation. Experiments must be planned and specific methods set out before beginning to examine whether statistically significant change has occurred, that is, to test the underlying hypothesis. When a structural alteration in cells and/or tissues is recognized and proven, this change is termed a lesion (or lesions), which potentially can be correlated to toxin exposure through dose–response and/or time-course studies.

6.10.1.2 Controls

The inclusion of appropriate controls is essential to determine whether a lesion(s) is present, and a lesion is caused by or associated with an experimental manipulation. Too often, investigators approach the experimental situation with the sole purpose of characterizing a lesion, without adequately examining control specimens for comparison. All appropriate controls must be included, and in adequate numbers, to make reliable, significant comparisons to the experimental. Controls must include groups administered vehicle alone, or any treatment/experimental condition that is not the dependent variable under study. For example, if a cotreatment is given to protect against (or exacerbate) a toxin-induced lesion, an essential control group given that cotreatment alone must be analyzed. Otherwise, it would not be known whether the cotreatment itself is toxic.

The normal morphologic structure of the heart and various blood vessels has been well described, and many standard references are available (Ferrans 1980; Leak 1980; Rhodin 1974). Nevertheless, reliance on even an expert’s experience to remember all criteria for a normal, unaltered control tissue is not adequate, since fine details of normal control morphology are not always easily recollected, and subtle differences will arise that require comparison to the control.

6.10.1.3 Appropriate Choice and Equivalent Handling of Control and Experimental Groups

Control specimens must be identical in all respects to the experimental with regard to animal species, handling and animal husbandry, diet, time, and method of sacrifice, and especially fixation and other tissue methods (tissue sampling, processing, sectioning, and staining). It is not proper, for instance, to perform special studies on the experimental tissue without including the control for comparison. Similarly, numbers of controls must be comparable to the experimental.

6.10.1.4 Blinded Observation

Observers who are responsible for ascertaining whether a lesion is present should be blinded to the specimen at the time of examination. This is best done in a double-blinded fashion, that is, where a third party encodes specimens and the code is not broken until all observations have been made and recorded. Criteria for determining or defining the characteristics of a lesion should be clearly agreed upon at the outset of the study, and the conclusions are strengthened if more than one observer is used.

6.10.1.5 Grading of Lesions – Semiquantitative and Quantitative Methods

Sometimes, it is only possible to determine whether lesions are present or absent, without a more qualitative or quantitative assessment possible. This may be adequate for addressing the hypothesis as long as it is determined that such lesions do not occur in controls. An accurate means of determining the severity of lesions is preferable, however. A common means of accomplishing this is for all observers to agree on a grading system for the lesion; this system should be recorded and adhered to. Often, grading systems for lesions can only be devised after a lesion is found in preliminary studies. Alternatively, once a lesion is determined to be present in experimental animals, observers may repeat either the whole experiment or the observations on available samples (again, in a blinded fashion).

Typical grading systems might be devised on a numerical scale such as 1–4 or 1–5, as long as differences in grades and criteria for each grade are evident and are agreed upon by all observers. Such systems may lend themselves to statistical analysis by Student’s t-test or ANOVA, given adequate numbers within groups, but only within reason (means should not be taken out to multiple decimal points).

Whenever possible, quantitative rather than qualitative methods should be applied to assess lesions. These methods include counting structures/cell or unit area of the specimen, point-counting techniques, or any of a variety of available computerized morphometric analytic schemes (Chung et al. 2007; Gong et al. 2008; Ivnitski-Steele et al. 2004). Again, such methods must be applied equally to control and experimental situations in a blinded fashion in order to draw significant conclusions.

6.10.1.6 Conducting a Morphologic Evaluation

Morphologic evaluations in toxicologic studies many times may consist of stand-alone experiments whose objective is to ascertain specific pathologic lesions associated with a chemical exposure. Often, however, morphologic evaluations serve as adjuncts to chemical, genetic, or pathophysiologic experiments, or are performed simultaneously with other studies in preliminary testing when a toxic effect has not yet been defined. In addition, morphologic evaluation may be done in order to simply confirm that a cardiovascular specimen is appropriate or adequate for other studies, as should be done in studies of isolated vessels (Conklin et al. 2006); in such cases, the confirmatory morphologic data may not necessarily be illustrated in the final publication of the study. No matter what the objectives or specific experimental goals are, the basic principles of sound scientific inquiry in the application of morphologic methods remain unchanged.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780080468846007089

Hypothesis Testing

R. Haines-Young, R. Fish, in International Encyclopedia of Human Geography, 2009

In this case the null hypothesis is that there is no difference between dairy farmers and other farmers in the extent to which they recognize nitrates as a potential pollutant arising from their land. For these data, the differences between the observed and expected counts are much smaller. The Pearson Chi-Square statistic (χ2) is equal to 1.131, and with one degree of freedom, we would exceed this value to be exceeded by chance with a probability of about 0.29 (i.e., 29 times out of 100). Thus, we cannot reject the null hypothesis. Dairy farmers appear to be no different to other farmers with respect to nitrates.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780080449104004545

Fundamental Principals of Statistical Inference

Darryl I. MacKenzie, ... James E. Hines, in Occupancy Estimation and Modeling (Second Edition), 2018

3.5.1 Background and Definitions

In hypothesis testing there is a null hypothesis and an alternative hypothesis. The idea is to develop a test statistic that has a known distribution under the null hypothesis and see if the observed value of the test statistic based on the data is unusual when compared against this known distribution.

Null Hypothesis (H0): θ=θ0

Alternative Hypothesis (HA): θ≠θ 0

The alternative hypothesis may be one-sided (e.g., θ>θ0) or two-sided, but often takes the two-sided form presented above. The first step after one defines the null and alternative hypotheses is to define a test statistic that has a known distribution under the null hypothesis. The next step is to define a critical region of values for the test statistic where the probability of obtaining values in this region is equal to α (the size of the test).

Hypothesis testing leads to a dichotomous decision; one either rejects the null hypothesis or not. The investigator's decision (to reject or not) can be either right or wrong with respect to the truth. Therefore, statisticians provide a nomenclature for the errors that can be made, depending on the decision of whether or not to reject the null hypothesis.

Type 1 Error: The probability that the null hypothesis is rejected when it is true (α).

Type 2 Error: The probability that the null hypothesis is not rejected when it is false (β).

Power of a Test: The probability that the null hypothesis is rejected when it is false is called the power of the test and is (1−β). A good test will have high power even for values of θ close to, but distinct from, θ0.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780124071971000041

The Role of Information Provision on Public GAP Standard Adoption

D. Jourdain, ... G. Shivakoti, in Redefining Diversity & Dynamics of Natural Resources Management in Asia, Volume 1, 2017

18.4.2 First Time Adoption

When testing the null hypothesisH0 := 0, the Wald test statistics of 2.59 (Prob(> χ2) = 0.107), and the LR statistic of 0.437 (Prob(> χ2) = 0.51) do not allow us to reject the null hypothesis of no correlation. However, since the Wald test is close to a 10% threshold, and that outright rejection would be more convincing than inability to reject, estimation results for the individual probit models and for the RBPM are presented in Table 18.5. The two formulations are giving similar results in terms of signs of the relationship between adoption and potential explanatory variables. However, the RBPM formulation is showing higher significance of the relationship with the variables groups and training. Variables included in both equations have the same signs in the two equations, meaning the potentially indirect effects are reinforcing effects on adoption.

Table 18.5. Maximum Likelihood Estimates of Separate and Recursive Probit Models

VariablesSep. Probits Coefficients (Robust SD)RBPM Coefficients (Robust SD)
(intercept)− 2.35 (0.55)*** − 1.53 (0.61)**
Education (years)0.09 (0.04)** 0.08 (0.03)**
Experience—medium0.42 (0.36) 0.38 (0.31)
Experience—high0.39 (0.36) 0.34 (0.31)
Groups0.38 (0.29) 0.64 (0.26)**
Gov. contacts—frequent1.36 (0.27)*** 1.21 (0.30)***
Know GAP farmers—high0.22 (0.31) 0.19 (0.26)
GAP other channels—yes1.59 (0.39)*** 1.43 (0.37)***
Labor per ha1.85 (0.53)*** 1.75 (0.48)***
O-farm0.34 (0.21) 0.29 (0.19)
Full ownership0.14 (0.25) 0.15 (0.22)
Exp. cost reductions—yes1.18 (0.24)*** 1.01 (0.30)***
Rice training—yes− 0.26 (0.22) − 1.23 (0.39)***
Log likelihood− 75.54
Rice training
Labor per ha0.34 (0.17)* 0.35 (0.17)**
Perception impact—yes0.27 (0.14)* 0.27 (0.14)*
Groups—yes0.92 (0.19)*** 0.93 (0.19)***
0.74 (0.43)
Log likelihood− 140.6 − 215.9
N244 244

Wald test of ρ = 0; χ2(1) = 2.59; Prob > χ2 = 0.107.

Likelihood ratio test of ρ = 0; χ2(1) = 0.59; Prob > χ2 = 0.441.

Standard errors in parentheses; *p < 0.1, **p < 0.05, ***p < 0.01.

The AME of each explanatory variable on adoption and training are presented in Table 18.6. These AME give more meaningful information as they can be interpreted in terms of impact on the probability of adoption, and also integrate the potentially indirect effects captured in the recursive system of equation. Results presented in Table 18.6 were calculated with the hypothesis of no correlations between the two errors. The variables with the strongest positive impact on Q-GAP adoption are related to farm labor available (labha), farmers' affiliations in farmer groups (groups), and farmers connections to sources of information (with an equal strength of the effects of extension contacts Govcont and other sources of information GAPcha).

Table 18.6. Average Marginal Effects of the Dependent Variables on Q-GAP Adoption

VariableDirectIndirectTotalSt. Err.Sig.a
QGAP equation
Education (years)0.016 0.02 0.008 0.047
Experience—medium0.074 0.07 0.069 0.286
Experience—high0.068 0.07 0.066 0.307
Groups—yes0.067 0.41 0.48 0.091 0.000
Gov. contacts—frequent0.290 0.29 0.061 0.000
GAP other channels—yes0.254 0.25 0.050 0.000
Know GAP farmers—high0.039 0.04 0.060 0.518
Labor per ha0.327 0.15 0.48 0.121 0.000
O-farm—yes0.060 0.06 0.041 0.145
Full ownership—yes0.025 0.03 0.046 0.587
Exp. cost reductions—yes0.245 0.25 0.053 0.000
Perception impact—yes − 0.004 0.005 0.38
Rice training—yes− 0.046 − 0.05 0.002 0.000
Training equation
Labor per ha0.111 0.11 0.060 0.063
Groups—yes0.278 0.28 0.046 0.000
Perception impact—yes0.089 0.09 0.046 0.050

ap value of the Wald test.

Education, measured by the number of schooling years of the household head, has a positive statistically significant relationship. This result extends to standard adoption the findings of literature on agricultural technologies adoption that the longer the farmers' schooling experience, the higher the tendency to adopt new technologies (Feder et al., 1985; Chouichom and Yamao, 2010; Liu et al., 2011). Farmers who have been through school are probably more equipped to understand the reason behind Q-GAP efforts and can follow the instructions of the program.

Besides, as the registration also requires participant to record their practices, more educated farmers are probably less impressed by this administrative exercise. However, the magnitude of the relationship is relatively limited (around 2% increase in adoption for an additional schooling year).

In the same way, farmers' experience has a positive (but not significant) relationship with Q-GAP adoption. This is in line with Knowler and Bradshaw (2007) who did not find consistent and clear impacts of experience on adoption of conservation agriculture across the studies they reviewed. If we retain the positive correlation, this indicates that experienced farmers evaluate more positively the potential of the Q-GAP program than inexperienced ones. It concurs with Chouichom and Yamao (2010) who showed that longer experience in farming and more years of education were related to conversion to organic rice farming in Surin Province in Thailand. However, this link is tenuous in our case. Farmers' participations in associations, cooperatives, and groups have positive and highly significant effects on both training and Q-GAP adoption. This confirms results found for adoption of conservation agriculture (Adesina et al., 2000). In our case, the positive effect for Q-GAP adoption is only significant under the RBPM formulation, probably as a result of the endogeneity of the rice-training variable. Common unobservable variables, such as dynamism and dedication to rice agriculture, is likely to explain both training attendance and Q-GAP adoption. Besides, AME results are showing that most of the effect of the variable group on Q-GAP adoption is indirect (via the training variable) reinforcing the possibility of a selection bias.

Frequent contacts with government and extension officers have a positive and statistically significant impact on Q-GAP adoption. This is consistent with the technology adoption literature. Feder et al. (1985), summarizing a large spectrum of adoption studies, concluded that education and extension services contacts improves farmers' ability to adjust to changes. Similarly, Moser and Barrett (2006) found that learning from extension agents influenced the decision to adopt low-input rice production methods. More recently and in a reverse relationship, low rate of adoption of sustainable agriculture in China was linked to inadequate agricultural extension efforts (Liu et al., 2011). A dual relationship may be at work: (a) more contacts are improving farmers' skills as extension officers are transmitting knowledge, but on the other hand, the farmers that maintain close contact with extension offices are probably more dedicated to agriculture. Other channels of information (variable GAPChannel) have also some positive and significant impact on Q-GAP adoption. Other channels in this case included family members, friends, village chief, community leader, experience with GAP for vegetable crops, and local soil doctors (ie, trained volunteers providing soil recommendation services to other farmers in the community).

It was expected that farmers having observed many neighbors adopting Q-GAP would be more likely to adopt: it is very common that community members decide to follow similar management patterns as each may not want to be left out (social cohesion). For example, the social cohesion factor was found to be one of the key variables of local community adoption of conservation agriculture in Laos (Lestrelin et al., 2012). However, contrary to our expectations, the number of neighbors known to be adopting Q-GAP did not have a significant relationship with adoption. Several hypotheses can be made about this counterintuitive result. First, social cohesion might be low in the agricultural zone we chose. Central plains are now cultivated by relatively larger farms and a substantial number of farms are managed by farmers that do not live permanently in the area and/or are passing orders to contracted labor. As a result, the farm-to-farm transmission is likely to be slower than expected. A second and more worrying interpretation for the Q-GAP program would be that farmers that did observe earlier adopters were not really convinced that it would fit their needs and constraints. Under such an assumption, farmer-to-farmer connections might not be efficient in spreading the program.

Labor availability is often affecting farmers' adoption decisions, especially for smallholders (White et al., 2005; Lee, 2005). Farmers adopting Q-GAP have to dedicate more time for rice cultivation. First, it requires recording all activities conducted on the farm and encourages some practices that are likely to substitute time-saving but potentially polluting practices with more knowledge and time-intensive practices. For example, using less pesticides requires more pest monitoring of the rice fields. Not surprisingly the variable labha (ie, the amount of family labor available for rice farming per ha) has a positive and highly significant effect on both adoption and training attendance. Contrary to the variable group, the influence of labha is mainly a direct effect, as the time constraints are more likely to be important once Q-GAP has been adopted. Contrary to our expectations, off-farm opportunities have a positive effect on adoption. However, this relation is not significant and cannot really be commented on.

Not all perception variables could be included in the analysis because of collinearity issues: for example, farmers who anticipated some cost-reduction potential before adopting Q-GAP were also anticipating better market access for their certified products. Among the different perceptions elicited during the interviews, we retained only the anticipations farmers had about the cost-reduction potential of Q-GAP. The relationship between expected cost reduction and adoption was both strong (25% probability increase) and significant, meaning farmers who adopted were really convinced that adopting Q-GAP would reduce their expenditures, possibly through a more rational use of chemical inputs. A nontested hypothesis here is that participation in Q-GAP could be associated by farmers with dedicated external advice leading to their more efficient use of inputs. Although not included in the model, adopters' expectations were probably high in terms of access to new markets, and price mark-up (because these variables are positively correlated to the variable on cost-reduction perceptions). We found a positive relationship between farmers attending training sessions and their perceived negative impact on the environment, but it is difficult to decide on the “direction” of the relationship. However, farmers' perception of negative impact on the environment did not translate into a significant effect on adoption.

The coefficient for training is giving unexpected results as we were expecting that farmers having attended a dedicated presentation about Q-GAP would be more likely to adopt. In fact, both individual probit and RBPM models are showing a negative relationship between training attendance and Q-GAP adoption. This should not be immediately interpreted as a sign of poorly conducted training (although this cannot be ruled out). An equivalently possible interpretation is that farmers are forming some positive expectations from the different contacts they had (justifying their training attendance) but are actually disappointed once they understand clearly the costs and benefits associated with Q-GAP adoption. On the one hand, farmers may be expecting higher “costs” in terms of labor requirements, which may prevent larger landholders from adopting (identified by the variable labha). On the other hand, some farmers may not be convinced about the potential benefits presented to them; as the government agencies are not responsible for the marketing of the Q-GAP rice, they can only suggest that the rice produced under Q-GAP will be more attractive, but cannot guarantee it. In the same way, farmers may not be confident in the capacity of the new practices in reducing production costs (for example, by using less pesticides or different types of fertilizers). In other words, farmers may well be interested by the general concept of Q-GAP but may ultimately make a rational decision related to labor issues as we showed earlier. Finally, one should also note that the negative impact is relatively small (− 5% for farmers attending the training).

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780128054543000189

Statistical Methods and Error Handling

Richard E. Thomson, William J. Emery, in Data Analysis Methods in Physical Oceanography (Third Edition), 2014

3.14.1 Significance Levels and Confidence Intervals for Correlation

One useful application of null hypothesis testing is the development of significance levels for the correlation coefficient, r. If we take the null hypothesis as r = ro, where ro is some estimate of the correlation coefficient, we can determine the rejection region in terms of r at a chosen significance level α for different degrees of freedom (N − 2). A list of such values is given in Appendix E. In that table, the correlation coefficient r for the 95% and 99% significance levels (also called the 5% and 1% levels depending on whether or not one is judging a population parameter or testing a hypothesis) are presented as functions of the number of degrees of freedom.

For example, a sample of 20 pairs of (x, y) values with a correlation coefficient less than 0.444 and N − 2 = 18 degrees of freedom would not be significantly different from zero at the 95% confidence level. It is interesting to note that, because of the close relationship between r and the regression coefficient b1 of these pairs of values, we could have developed the table for r values using a test of the null hypothesis for b1.

The procedure for finding confidence intervals for the correlation coefficient r is to first transform it into the standard normal variable Zr as

(3.113)Zr=12[1n (1+r)−1n(1−r)]

which has the standard error

(3.114)σz=1(N−3)1/2

independent of the value of the correlation. The appropriate confidence interval is then

(3.115)Zr−Zα/2σz<Z<Zr+Zα/2σz

which can be transformed back into values of r using Eqn (3.113).

Before leaving the subject of correlations we want to stress that correlations are merely statistical constructs and, while we have some mathematical guidelines as to the statistical reliability of these values, we cannot replace common sense and physical insight with our statistical calculations. It is entirely possible that our statistics will deceive us if we do not apply them carefully. We again emphasize that a high correlation can reveal either a close relationship between two variables or their simultaneous dependence on a third variable. It is also possible that a high correlation may be due to complete coincidence and have no causal relationship behind it. The basic question that needs to be asked is “does it make sense?” A classic example (Snedecor and Cochran, 1967) is the high negative correlation (−0.98) between the annual birthrate in Great Britain and the annual production of pig iron in the United States for the years 1875–1920. This high correlation is statistically significant for the available N − 2 = 43 degrees of freedom, but the likelihood of a direct relationship between these two variables is very low.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B978012387782600003X

Numerical Ecology

Pierre Legendre, Louis Legendre, in Developments in Environmental Modelling, 2012

3 — Tests of statistical significance

In correlation analysis, the null hypothesis H0 is usually that the correlation coefficient is equal to zero (i.e. independence of the descriptors). One can also test the hypothesis that ρ has some particular value other than zero. The general formula for testing correlation coefficients (for H0: ρ = 0) is:

(4.39)F=rjk2/v1(1-rjk2)/v2

with v1 = m and v2 = n – m – 1, where m is the number of variables correlated to j. This F-statistic is compared to the critical value Fa[v1,v2]. In the case of the simple correlation coefficient, where m = 1 (there is a single variable correlated to j), eq. 4.39 becomes eq. 4.12.

In regression analysis, the null hypothesis is that the coefficient of multiple determination (R2) is zero. To test the coefficient of multiple determination R2 and the multiple correlation coefficient R, the F-statistic is:

(4.40)F= R1.2…p2/v1(1 -R1.2…p2)/v2

with v1 = m and v2 = n – m – 1, where m is the number of explanatory variables; m = p – 1 in the notation of eq. 4.40.

Partial correlation coefficients are tested in the same way as coefficients of simple correlation (eq. 4.12 for the F-test and eq. 4.13 for the t-test, where v = n – 2), except that one additional degree of freedom is lost for each successive order of the coefficient, or each covariable in the model. For example, the number of degrees of freedom for rjk.123 (third-order partial correlation coefficient) is v = (n – 2) – 3 = n – 5.

This is the same as counting v = n – m – 1, where m is the number of variables in the model besides j. For partial correlations, eqs. 4.12 and 4.13 become respectively:

(4.12)F=vrjk.1…p21- rjk.1…p2

and

(4.13)t=vrjk.1…p1-rjk.1…p2

The number of covariables will be called q in Subsections 10.3.5 and 11.1.7 which describe, respectively, the tests of significance in partial regression and partial canonical analysis. Semipartial correlation coefficients are tested using the same Fstatistic as for partial correlations, as shown in Box 4.1. As usual (Sections 1.2 and 4.2), H0 is tested either by comparing the computed statistic (F or t) to a critical value found in a table for a predetermined significance level α, or by computing the probability associated with the computed statistic.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780444538680500046

Postanalysis in Adjustment Computations

Bashar Alsadik, in Adjustment Models in 3D Geomatics and Computational Geophysics, 2019

12.2 Goodness of Fit Test

Goodness of fit test is an important statistical test that uses chi-squared χ2 distribution in hypothesis testing of adjusted observations and postanalysis.

Every hypothesis test requires preparation of a null hypothesis (H0) and an alternative hypothesis (Ha). If one hypothesis is true, the other must be false and vice versa. The null hypothesis H0 means [57,58]:

The observations are similar.

There is no significant correlation between the observed variables.

The observations are normally distributed.

On the other hand, the alternative hypothesis Ha means:

The observations are different.

The correlation is significant between the observations.

The data are not normally distributed.

Two main hypothesis tests are applied with χ2:

When goodness of fit is successfully passed, this indicates that the adjustment of observations problem is well applied, and no errors exist. Statistically, we accept the null hypothesis and reject the alternative hypothesis:

(12.1)H0:σˆo2=σo2

where

σo2: the prior value of the variance (before adjustment) and usually assumed 1.

σˆo2: the posterior value of the variance (after adjustment).

When goodness of fit test fails, this indicates an error in the adjustment procedure such as blunder existence or improper weights assigned to observations. Statistically, we reject the null hypothesis and accept the alternative hypothesis as:

(12.2)Ha:σˆo2≠σo2

The χ2 value can be computed as follows:

(12.3)χ2=rσˆo2σo2

where r is the redundancy in observations.

The idea of the test is to statistically compare the distribution (quality) of observations before and after adjustment and determine whether the sample data are consistent with a hypothesized distribution. To apply the goodness of fit test, we should specify the significance level α. Frequently, researchers choose significance levels equal to 0.01 (confidence of 99%) or 0.05 (confidence of 95%).

After computing the value in Eq. (12.3), we compare the χ2 computed value to the χ2 tabular value at a specified significance level as shown in Eq. (12.4):

(12.4)χ2r1−α2<rσˆo2σo2<χ2rα2

Then the test indicates a passed value, and the null hypothesis H0 is accepted.

On the other hand, if:

(12.5)rσˆo2 σo2<χ2r1−α2rσˆo2σo2>χ2rα2

Then the goodness of fit test fails, and the alternative hypothesis Ho is rejected. In statistics, this kind of test is also called the two tails test (Fig. 12.2). It should be mentioned that, in geomatics, the adjustment tests are only involved with the right tail of the distribution. So, when rσˆo2σo2>χ2r α2, Ho is rejected and Ha is accepted.

3. what hypothesis states no equality or existence of differences, relationship, or effect? *

Fig. 12.2. Goodness of fit rejection and acceptance levels at 95% confidence.

Accordingly, in this section we briefly presented the goodness of fit test using χ2 distribution to indicate a generic impression about the adjustment process. However, this test does not specify which observation has failed or caused the test to fail, and should be adapted by removal, remeasuring, or reweighing. The following sections will discuss the detection and removal of blundered observations.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B978012817588000012X

What is the hypothesis of states of equality?

For two samples, the null hypothesis states that the difference between the mean/medians of the populations is equal to a hypothesized value (0 indicating no difference), against the alternative hypothesis that it is not equal to (or less than, or greater than) the hypothesized value.

What is the hypothesis that shows equality or no significant difference?

A null hypothesis is a type of conjecture in statistics that proposes that there is no difference between certain characteristics of a population or data-generating process.

What type of hypothesis that states no relationship between variables?

The null hypothesis is a typical statistical theory which suggests that no statistical relationship and significance exists in a set of given single observed variable, between two sets of observed data and measured phenomena.

Which type of hypothesis states that there is a relationship a change or a difference between the two variables of interest?

The alternative hypothesis states that there is a relationship between the two variables being studied (one variable has an effect on the other).