优化的插值法与固定地点样品数据点吗?- 江南体育网页版- - - - -地球科学堆江南电子竞技平台栈交换

优化的插值法与固定地点样品数据点吗?

2014 - 10 - 30 - t05:47:53z

我想找到一个优化的空间插值方法,处理固定地点样本数据点。说如果我有固定地点采样站点和每天收集数据,然后使用这些数据来创建插值的表面。有方法,预处理这些采样站点之间的空间关系,然后将这种空间关系集成到插值方法加快整个插补过程?我用谷歌搜索了一些信息,但找不到我想要的。谁能让我在正确的方向吗?谢谢。

回答胡安·j·优化插值法与固定地点样品数据点吗?

2015 - 03 - 19 - t09:30:35z

假设有数据点之间的空间相关性,插值可以通过使用< a href = " http://en.wikipedia.org/wiki/Kriging " rel =“nofollow”>普通克里格< / >或其他方法的变体。

答案由user2821优化插值法与固定地点样品数据点吗?

2015 - 03 - 19 - t13:29:20z

欢迎来到一个有趣的和重要的研究领域!

Interpolation is a large topic and there are many techniques. I agree that the GIS or math forum could be more initiated, but for completeness, I'll post some thoughts here.

The choice of method depends on the type of data you are using and the implementation depends on your environment. GIS software should always have a number of methods and parameters to adjust. It's also rather easy to create a script in e.g python or matlab to test and evaluate interpolation methods (E.g this).

Some questions to answer before you chose your method are:

Can I trust all my data? How much noise do I have?
Should only the nearest points be considered for the interpolation? How far should I 'feel' the values from data points?
How many dimensions should be interpolated? E.g. should I take time into consideration?
Should the curves be as smooth as possible?
Should all interpolated values be in a specific range, within min-max of the whole data set or between the nearest points?
Do I know the total value? E.g. when interpolating population density I might know the total population and this could help me find the right interpolation method.
Process speed, that is an issue in your case might force you to choose a simpler technique.

Wikipedia has an overview and Caruso, C., and F. Quarta. "Interpolation methods comparison." Gives a rather good introduction of the techniques and there are some good webpages on GIS and interpolation.

The most important step is to test the method you choose. Again, there are various ways to do so, but the easiest is to simply leave points out from you data set and interpolate the values without them. Try to make the interpolated value as close to the data value as possible.

Kriging are methods based on the actual data instead of a curve. It's often a good first try on spatial data but also in other application like this. Spline methods produces smoother curves. People doing computer graphics are experts on this, but there can also be applications where you'll aspect smooth changes in a model.

Naturally, more complex methods (e.g. kriging) take longer time to process, whiles simpler algorithms (e.g. bilinear or nearest-neighbor) are faster. Lower resolution of your output raster (array) would also speed up the process. You can let your resolution depend on the variance of the data, so that you interpolate less points in a flat surface and more points for a surface with higher relief. This techniques are common practice in signal processing and the experts are here.

弗雷德回答的优化和固定地点采样数据点插值法吗?

2015 - 03 - 19 - t13:31:08z

看的好地方是< A href = " https://en.wikipedia.org/wiki/Geostatistics " >地质统计学< / >。这是统计学的一个分支关注空间或时空数据集。在其早期,地质统计学也称为区域化变量理论。

Geostatistics was initially developed by George Matheron, a French mathematician and geologist - though when I first heard about him in the 1980s he was described as a mining engineer.

Matheron developed geostatistics to find a better way to determined ore reserves more accurately. His other other aim was to try to ascertain a level of accuracy for ore reserves (i.e x ounces of gold at a confidence of y %) . Since the initial development of geostatistics in the 1960s for ore reserves in mining, the technique has been used in other fields where there is a spatial co-relation between data samples, such a forestry, pollution contamination around metal smelters, soil science, geochemistry, etc.

One of the key tools of geostatistics is the variogram. The variogram is used to determine the distance at which samples can be regarded as being independent of each other.

The mathematics for variograms can look bewildering, but variograms can be explained simply. Consider a line of 100 equally spaced sample points, say 1 metre apart.

Determine the variance for all data points 1 m apart, i.e 1 & 2, 2 & 3, ... 19 & 20 ...
Next calculate the variance for all points 2 m apart: 1 & 3, 3 & 5, ... 17 & 19, also 2 & 4, 4 & 6, ... 18 & 20 ...
Then find the variance for samples 4 m apart and then 5 m apart and 10 m, 20 m, 25 m and 50 m

The sample distances must be factors of the total distance. You can find the variance for samples 3 m apart because 3 doesn't divide into 100 evenly, but if there were only 99 samples then things are totally different.

The distance between samples is known as the lag distance. On a plot of variance versus lag distance, for low lag values the curve will rise. Where the curve flatten out is the distance at which samples can be regarded as being independent of each other. The variogram has its roots in the correlation diagrams.

For data on a 2D grid, it is prudent to obtain a suite of variograms along the directions north-south, east-west, northwest-southeast and northeast-southwest to obtain an initial variance rosette. Any trend in the data will be seen from this and a better variogram orientation can then be tried.

Where the distance between samples is strictly not uniform, but within accepted tolerances, adjustments will need to be made to the data. Also if data is clustered within areas, the data will need to be declustered first.

When data points are being considered it is common for a search ellipse to be used and for the search ellipse to be orientated. For example, is the data set is soil contamination samples around a metal smelter, or seed dispersal from trees, the search ellipse would be orientated so the major axis would be aligned with the direction of the prevailing winds.

Because of the amount of calculations involved and the size of some data set, particularly metal grades in geological ore bodies, computers are heavily involved with geostatistical analyses.

An initial reference on geostatistics is Isobel Clark, 1979, Practical Geostatistics, Applied Science Publishers

If you want to estimate a value between sample points then either some form of weighting of samples will need to be used. Within geostatistics, kriging is usually used, but there are many forms of kriging, just to name a few: ordinary, disjunctive, log-normal, indicator. What some people also do is to use other forms of weighting such as inverse distance squared, or inverse distance to another power, usually between 1.5 and 3, but inverse distance squared is the more common of these. The results from kriging and inverse distance squared are then compared. The two techniques will rarely give the same results, but the results should be similar.