Next: `variogram'
Up: Command file syntax
Previous: Commands
  Contents
  Index
Subsections
`data'
The general form of the data command is
data( identifier): 'file', options ;
The file name should refer to an existing file in ascii table form,
simplified GeoEAS format or one of the supported grid map formats.
Options can be single keywords like log or expressions like
x=2. Column 0 means not defined (non-existent). The full
list of options is [default values between square brackets]:
- v=5
- column 5 contains the data (measurement) variable [0, or obtained from
grid map]
- x=1
-
column 1 contains the x-coordinate [0, or obtained from grid map]
- y=2
-
column 2 contains the y-coordinate [0, or obtained from grid map]
- z=3
-
column 3 contains the z-coordinate [0, or obtained from grid map]
- d=1
-
use a first order (polynomial) linear model in the coordinates as the
trend; allowed order values are 0, 1, 2 and 3; see also X,
sk_mean and b [0: only an intercept
as trend]
- mv=-1
-
define missing value as the value -1 [the string NA, see also
set mv]
- average
-
average values with identical locations (i.e., their separation distance
is less than zero) [noaverage]
- log
-
log transform the variable (natural logarithm) [no transform]
- I=5
-
transform the observation variable
to
[no transform]
- v=6, Category='sand'
-
transform the observation variable
to 1 if the string in column
6 equals the Category string sand, and to 0 in any other case.
[no transform]
- standard
-
standardise variable (to mean 0, variance 1) [do not standardise]
- X=8&9&x&y
-
apart from a default intercept, the values of the base functions at
the data locations are the variables that are in columns 8, 9 and the
x- and y-coordinate.
(Polynomial coordinate base functions allowed are:
x3 for
,
y3 for
,
z3 for
,
x2 for
,
y2 for
,
z2 for
,
x for
,
y for
,
z for
,
x2y for
,
xy2 for
,
x2z for
,
xz2 for
,
y2z for
,
yz2 for
,
xy for
,
xz for
and
yz for
,
provided that the corresponding coordinate is defined)
[use only an intercept (mean) as trend]
- X=-1&8&9
-
the values of the base functions at the data locations are on columns
8, 9 and 10, with no intercept [use only an intercept (mean) as trend]
- b=[2.4, 1.7, -3.9]
-
define the (known) regression coefficients, corresponding to the
X entries given. See also sk_mean; b
generalises the concept of a known constant mean to a known mean function.
[undefined; regression coefficients are unkonwn]
- V=6
-
column 6 contains the proportionality factor
to the residual
variance
of the v-variable (i.e. the diagonal
entries of matrix
,
see Ordinary and weighted least squares trend
prediction in section 2.7, and Appendix A.2). This
will have an effect on the least squares residuals (thus affecting the
sample covariogram and pseudo cross variogram), as well as on uncorrelated
least squares prediction, kriging prediction, and trend prediction. [0:
assuming identical variances or variances strictly derived from the
variograms]
- noresidual
-
do not calculate OLS (or GLS, see gls) residuals for sample
variogram or covariogram estimation (Appendix A.1). For
sample variogram estimation in absence of base functions, setting
noresidual will yield identical results, but will result in a modest
gain in speed and memory saving. In other cases, it will result in
the estimation of non-centred covariograms or pseudo-cross variograms
[calculate residuals before variogram or covariogram estimation]
- dX=0.1
-
include a pair of data points
,
for sample variogram
calculation only when
with
and
.
This allows
pooled estimation of within-strata variograms, or variograms of
(near-)replicates in a linear model (for point pairs having similar values
for regressors like depth, time, or a category variable)
[do not evaluate]
- radius=4.5
-
select observations in a local neighbourhood when they are within a
distance of 4.5 [large: select all] (see section 2.5)
- max=30
-
maximum number of observations in a local neighbourhood selection is 30
[large: no maximum] (see section 2.5)
- min=10
-
minimum number of observations in a local neighbourhood selection is 10
[0] (see section 2.5)
- omax=2
-
maximum number of observations per octant (3D), quadrant (2D) or secant
(1D) is 2 (this only works in addition to a radius set) [0: don't
evaluate] (see section 2.5, and also method: nr;,
section 4.5)
- square
-
select points in a square (or block) neighbourhood, with square (block)
sizes equal to 2
radius [circular (spherical) neighbourhood]
- vdist
-
use variogram value as the distance criterium for
min/max/omax neighbourhood selection (but
define radius as euclidian distance) [use Euclidean distance]
(see section 2.5)
- force
-
force neighbourhood selection to the minimum number of observations,
disregarding the distance [unless simple kriging is used, generate a
missing value if less than the required minimum number of observations
are found within a distance defined by radius] (see section
2.5)
- s=7
-
define variable with the strata for data() locations [0, no
strata]
- sk_mean=2.4
-
define the simple kriging mean to be 2.4 [not defined: an unknown
mean (intercept) is assumed for each variable]. NOTE: the code
sk_mean=2.4 is equivalent to b=[2.4]
- dummy
-
define a dummy variable [require valid data to be read]
Next: `variogram'
Up: Command file syntax
Previous: Commands
  Contents
  Index
Edzer Pebesma
1999-08-31