Product Overview of OmniSci
OmniSci, now part of HEAVY.AI, is a cutting-edge analytics platform designed to handle big, high-velocity data with unprecedented speed and scalability. Here’s a detailed overview of what OmniSci does and its key features:
What OmniSci Does
OmniSci is the world’s fastest SQL engine, enabling users to query and visually explore large datasets at extreme speeds. It is optimized for processing billions of rows in milliseconds, making it ideal for applications involving massive and high-velocity data. The platform seamlessly integrates with existing CPU-based legacy analytics systems, leveraging the massive parallel computing power of both CPUs and GPUs to deliver no-lag data processing without the need for pre-aggregation or downsampling.
Key Features and Functionality
Core Analytics Engine
- OmniSci Core: This is an open-source SQL-based query engine capable of processing up to billions of rows in milliseconds. It boasts unprecedented ingestion speeds, making it perfect for high-velocity data environments.
Geospatial Analysis
- Native Geo Data Types: OmniSci supports native geospatial data types such as POINT, LINESTRING, POLYGON, and MULTIPOLYGON. This allows geospatial analysts to query and visualize millions of shapes on geo charts with millisecond response times.
- Geospatial Queries and Visualization: Analysts can perform complex geospatial queries, including distance calculations and shape containment checks, and visualize results interactively using backend polygon rendering.
Data Science and Machine Learning
- Integrated Data Science Capabilities: OmniSci allows data scientists to switch seamlessly between visual analytics and deeper exploration using machine learning. It integrates well with the PyData stack, including tools like JupyterLab, Pandas-like API (Ibis), and interactive data visualization from Altair.
- User-Defined Functions: The platform supports User-Defined Functions, enhancing its capabilities for ML/AI workflows and allowing for custom extensions.
Visual Analytics and Data Fusion
- OmniSci Immerse: This is a browser-based visualization client that enables instant cross-filtering and real-time data querying across multiple chart types. It supports interactive visual analytics, including point maps, heat maps, choropleths, and scatterplots.
- Data Fusion: OmniSci 5.0 introduces an integrated Data Catalog that allows users to import external public or partner datasets and visually join them in geospatial or time series charts.
Performance and Scalability
- GPU-Driven: The platform leverages GPU power to deliver lower latency and faster performance compared to CPU-only solutions, especially in interactive visualization scenarios.
- High-Speed Ingestion: OmniSci can ingest millions of polygons per second, supporting multi-threaded import and handling complex polygon shapes efficiently.
Interoperability and Integration
- Extensive Integrations: The platform integrates with various tools and ecosystems, including the PyData stack, JupyterLab, and other data science frameworks, making it a versatile solution for both business analysts and data scientists.
In summary, OmniSci is a powerful analytics platform that converges analytics, data science, and location intelligence into a seamless workflow. Its ability to handle massive datasets at extreme speeds, combined with its robust geospatial analysis, visual analytics, and machine learning capabilities, makes it an ideal solution for organizations dealing with big, high-velocity data.