Searching data in accordance with indicated parameters
Tools for exploring databases allow fast and effective identification of data the user is interested in among hundreds of thousands available within integrated databases. Parameters, such as: name, time period are entered flexibly depending on the needs and search complexity.
Reshaping and data enhancement
Reshaping and data enhancement is aimed at achieving the form which is the best for building and optimizing models of analysed phenomena.
Features performed as part of reshaping and data enhancement:
- Automatic changes of time intervals – used depending on whether the user is building models and analyses using daily, monthly, quarterly or other data.
- Completing missing data
- Data verification
- Functional transformation – so that the data is adopted by models in the easiest way
Analysing a single time series
Features performed as part of analysing a single time series:
- Calculating statistics, histograms and frequency distributions of time series
- Spectral analysis of time series – allows obtaining information about time cycles related to time series
- Decomposition of time series into:
- Trend component – responsible for long-term changes of time series
- Seasonal component – responsible for short or medium-term cyclical changes
Analysis of correlations between pieces of data
Analysis of correlations is usually aimed at selecting a group of data that influences analysed process of phenomenon. Based on selected data, a mathematical model is created.
If the model is for forecast purposes, one could say that the forecast is created based on data selected during analysis of correlations.
Features performed as part of analysis of correlations between pieces of data:
- Search for data set related to analysed phenomenon
- Searching for parallels at spectral components level. Sifting through database in order to find pieces of data, which behave in a similar fashion in specified time frames, for example, pieces of data correlated to each other at daily or monthly changes level
- Optimizing information capacity of the selected data set. After optimization, the data set should:
- Describe analysed phenomenon as well as possible
- Not include excess (too similar to each other) pieces of data – replication of nearly the same pieces of data does not contribute to the data set, and causes unnecessary growth of the mathematical model as well as increase in time needed for optimization
Building and optimizing mathematical models
- Data sets obtained through correlation analysis are used during building and optimizing models.
- Models may only be in a describing nature, or may lead to a forecast for analysed phenomenon.
- It is possible to define different objective functions during the optimization process. For example the user wishes to receive the model with minimised forecast error or maximised correlation coefficient.
- After current data streaming into optimized models, they serve as a source of knowledge or forecast.