In-Database Analytics
Why move the data to the analytics when you can move the analytics to the data?
Analyzing large volume of data presents numerous challenges it is time consuming, very expensive and requires management of complex technology infrastructure. And yet, analyzing large volume of data is what businesses need most desperately today! In traditional approaches for analyzing data, end-users must move data into memory for processing. This activity often accounts for up to 75% of the cycle time and imposes severe constraints on delivery of results. In addition, the client or server where the processing is done must have enough memory to store the data and intermediate results. Large unwieldy and expensive solutions have been devised and implemented using this approach, but there is a more elegant solution by taking the analytics to the data instead.
Introducing DB Lytix™
DB Lytix is an in-database computation engine that allows you to perform analytics within your database, which means you don't have to move the data from storage to memory and therefore, you don't have to build large computational infrastructure with servers, blades and load balancing software. Just deploy DB Lytix™ on your database, and you instantly have the ability to perform advanced analytics through simple SELECT and EXECUTE statements.
Using DB Lytix™ has multiple advantages:
- Rich repository of functions: DB Lytix™ contains a very rich collection of functions. From simple statistical analysis to Monte Carlo simulations to pattern recognition, DB Lytix™ provides the required functions to do it all. The DB Lytix™ library consists of the following:
- Mathematical and Statistical functions
- Monte Carlo Simulations- univariate and multivariate
- Data Mining & Pattern Recognition Principal Component Analysis, Linear Regression, Logistic Regression, Other supervised learning methods, and Clustering
- Ease of Use: End-users and programmers can perform complex data mining and business analytics by using conventional SQL statements (SELECT and EXECUTE stored procedures). The functions can be embedded in other functions and stored procedures, and can be invoked using ODBC calls. The simplicity of use makes DB Lytix™ very attractive for the end-users.
- Time-to-Action: The very paradigm of DB Lytix™ helps you gain efficiency in analysis, saves time and cost, and enables you to provide answers to the decision makers in minutes. Just imagine being able to perform pair-wise correlation and Eigen Value decomposition of a 1000 x 1000 matrix in less than 3 minutes!
- Affordable: You only need an enterprise license for DB Lytix™ instead of seat licenses as with other analytical software products. Also, since DB Lytix™ is deployable through simple SELECT statements, you do not need additional software to leverage the full potential of the library.
Who can Benefit from DB Lytix™?
- Quantitative Analysts
If you are already building models, you have felt the pain of transporting data from storage to the analytical units and must have always wished that things could be better. DB Lytix™ is the ideal solution for you. Our library can be used to replicate your existing solutions and help build new ones with dramatically improved performance. In essence, if you want to compete on analytics, DB Lytix™ is your tool of choice.
- Application Developers
If you are an application developer or systems integrator who would like to embed analytical functions in your product, we can assist you in creating specific solutions desired by your clients. By embedding analytical functions such as forecasting and behavioral segmentation of customers, you can differentiate your product from your competitors and create value for your customers by giving them the quantitatively driven edge.
- Database Vendors
If you are a database vendor, we can port our library efficiently into your platform and help your customers and partners greatly improve processing and analytic performance. By providing a rich set of functions, you can help your customers and partners accelerate development and improve their analytical capabilities manyfold.
For all customers, Fuzzy Logix will provide education, support and expert consultation.
Netezza
DB LYTIX™ and Netezza
Massively parallel architecture and access to a bit of "secret sauce" allow DB Lytix™ to run amazingly fast on the Netezza platform. The reason for this incredible performance can be understood after exploring a bit of the Netezza lexicon.
AMPP™(SMP and MPP) and SPUs
Netezza's AMPP™(Asymmetric Massively Parallel Processing) architecture marries a SMP (Symmetric Multiprocessing) host with MPP (Massive Parallel Processing). As programs are executed, the host compiles the code and distributes queries across individual SPUs to run in parallel. Once distributed, the SPUs (Snippet Processing Units) perform the processing. When the processing is complete, the SMP host aggregates the results.
The SPU is a complete computer with a processor, disk storage, memory and a key piece of the "secret sauce" of Netezza. Depending on the system, each Netezza appliance will have between 112 and 896 SPUs operational at any one time and allow you to store and process between 12.5TB and 100TB of data.
By using the combination of SMP Host and SPUs, parallelized SQL queries execute with blazing speed.
FPGA
Another acronym introduced by Netezza to the data warehouse world is FPGA (Field Programmable Gate Array). FPGAs are semiconductor chips that can be programmed and have been used for a wide variety of applications for many years. Netezza is the first and only company to leverage their power to process streaming data in a data warehouse appliance. This is the "Secret Sauce."
In traditional systems, all the data for a query is moved and then the "where" clause is processed. With Netezza, instead of moving a huge set of data, the FPGA processes the "where" clause as data streams off of the disk, so only the data needed for processing is moved to the next step.
UDx (UDF and UDA)
To access the power of the architecture, Netezza allows users to write UDFs (User Defined Functions) and UDAs (User Defined Aggregates) both known as UDxs. UDA's return one value for multiple rows, whereas UDF's return one value per row. UDA's and UDF's can be accessed through simple SELECT statement and UDF's can be embedded within UDA's and UDF's. Also, it is possible to embed UDA's within UDF's as arguments and vice versa.
With UDxs, Netezza allows you to write processing logic, such as data mining algorithms, to the SPUs. Netezza calls this OnStream™processing. By implementing this and other technologies, Netezza achieves speeds which are 10-100 times faster than traditional solutions
(AMPP™+ SPUs + FPGA + UDx) + DB Lytix™ = Stunning performance
By combining the Asymmetric MPP™architecture with SPUs and FPGAs, and then giving access to the SPUs via UDAs and UDFs, Netezza gives users a strong platform to achieve very high processing performance.
Pairing the platform with the DB Lytix™ library leverages the architecture of Netezza to its fullest extent and delivers results with stunning speed and accuracy.
Sybase
DB Lytix on Sybase IQ 15.1
Take an intelligent, innovative and industry leading column-oriented database on a grid based platform and embed a library of quantitative and analytical functions in the database kernel and you create an affordable, eco-friendly and manageable data mining solution that produces results with incredible speed and accuracy.
In-database Analytics on Sybase IQ - The Intelligent Columnar Database
Sybase IQ 15.1 is the only high performance column-based analytics server that supports hundreds of statistical and data mining functions, executed completely within the database while supporting a large number of concurrent users at high performance levels. With our library of quantitative functions, DB Lytix, running on Sybase IQ:
- Data is processed and analyzed completely within the database
- Analytical models can be developed using standard SQL
- Analytics can be applied to the latest and large data sets
- In-database infrastructure is extensible for adding new algorithms
Sybase IQ is called the intelligent database because of the smart technology that is applied to the columnar structure. Over the years, Sybase has integrated intelligent features that further enhance performance and reduce maintenance overhead. These features include: use of symmetric multiprocessing and parallelism, giant page sizes, efficient optimization and indices, a unique tokenization approach, push down joins, multiple writer nodes and ability to support high concurrency. For a detailed overview of these features, download
'DB Lytix on Sybase IQ 15.1'.
Traditional approaches to data analysis require users to transport data out of the database for processing. This data movement can account for up to seventy-five percent of the cycle time and can impose severe constraints on delivery of results. Balancing the need to process large volumes of data, with high throughput and accuracy has always been a challenge. When data volume, accuracy, and processing time are all critical success factors, the conventional wisdom driven by traditional analytics approaches has been "pick any two" (or sometimes just one).
By working closely with engineering teams at Sybase we have moved the analytics to the data and have embedded various quantitative algorithms in the kernel of the IQ platform. With this new in-database analytics capability, Sybase IQ 15.1 provides a platform that can achieve all three objectives simultaneously. What this means is that when you perform analysis, our algorithms fully exploit all of the intelligent features of Sybase IQ in the most expedient way possible.
To demonstrate the awesome power available, we've developed a number of uses cases including:
- Telecom churn analysis and price optimization
- Banking portfolio analysis, risk management and customer retention
- Life Sciences preventative care and drug trail analysis
- Marketing campaign analysis and program ratings
Each of the use cases represents a small sample of the types of solutions we can provide using our library of analytical functions on Sybase IQ 15.1.
Sybase IQ's smart technology combined with in-database analytics ensures extremely fast answers when running analytics and predictive models. Moreover, the system can economically scale and support many concurrent users. Leveraging our certified library of in-database analytics ensures a consistent and manageable analytics environment. Sybase IQ truly enables a new generation of analytics capability for a new generation of analytics users.
Please contact us to discuss how our solutions improve business performance.