Anomaly Detection in the Oracle Database Ecosystem Using Density Based Spatial Clustering (2024)

By Jim Prather, Cox Communications; Debanjali Battacharya, Genpact

In any large modern company, data has become the lifeblood of the organization and databases have become the beating hearts which supply that vital resource to every aspect of the company. Given that the failure of a single database, even for a short amount of time, can potentially lead to hundreds of thousands of dollars in lost revenue, it has become imperative to ensure complete reliability of every database within the ecosystem.

With hundreds to thousands of databases needing to be monitored, it has become increasingly difficult for Database Administrators (DBAs) to maintain adequate vigilance on every single database using standard monitoring techniques. Recently, companies have been turning to Machine Learning algorithms to “study” each database, determine if a database is displaying signs of distress, and then alert a DBA that action may be required on a given database.

One of the newest and most promising algorithms in use at Cox Communications is Density Based Spatial Clustering (DBScan). Fundamentally, the DBScan algorithm looks at groups of points which lie closer together (i.e. have a higher spatial density) and then assigns them to be in the same cluster. The process repeats until every data point has been assigned to a cluster, or else has been labelled an outlier.

It is these outliers, or anomalies, which may be harbingers of database problems.

Each night eight of the most important metrics, in five-minute increments, over the past thirty days of data are fed into the ML algorithm for each database. By using Principal Component Analysis, the data is converted from an eight-dimensional manifold to a three-dimensional surface and then used to create oneDBScan model per database. Given the trained model, whenever a new datapoint arrives, it is simply compared to the data in the pre-trained model to determine if the datapoint is “normal”, or if it is an anomaly which should be investigated further.

By operationalizing DBScan ML techniques on database monitoring data, database alerts have been accelerated by 15 minutes over existing monitors and decreased false positive alerts by a factor of six.

By clicking the "Download Paper" button, you are agreeing to our terms and conditions.

Similar Papers

Artificial Intelligence (AI)-Based Cross-Platform Node Name Mapping (Cmap-NN)
By Jordan Kupersmith, Cox Communications; Wei Cai, Cox Communications; Jeonpaolo Barvez, Cox Communications; Pavan Chandrashekar, Cox Communications; Richard Brown, Cox Communications
2023
Preventing Network Maintenance Collisions: Using Artificial Intelligence Models for Predicting Collisions in Planned Maintenance Activities
By Jordan Kupersmith, Cox Communications; Nate Bila, Cox Communications; Cherie Peirce, Cox Communications; Chase Durham, Cox Communications; Rob Arnold, Cox Communications
2024
Cox's Next Generation Serviceability and Location Based Intelligence Systems
By Sorna T. Dhanabalan, Cox Communications
2024
Upstream OFDMA Anomaly Detection and Triaging
By Jay Zhu & Karthik Sundaresan, CableLabs
2021
How Cox Communications Implemented an Expert System for Service-First Autonomous Operations
By Dave Norris, Cox Communications
2021
FTTH Distance and Density Considerations: Rural Broadband
By Brian Yarbrough, Cox Communications, Inc.; Chris Palmquist, Cox Communications, Inc.
2024
AI for IT Operations (AIOps) - Using AI/ML for Improving IT Operations
By Hongcheng Wang, Applied AI & Discovery, Comcast; Praveen Manoharan, Applied AI & Discovery, Comcast; Nilesh Nayan, Applied AI & Discovery, Comcast; Aravindakumar Venugopalan, Applied AI & Discovery, Comcast; Abhijeet Mulye, Applied AI & Discovery, Comcast; Tianwen Chen, Applied AI & Discovery, Comcast; Mateja Putic, Applied AI & Discovery, Comcast
2022
COX CPEONE Suite Now and in the Future!
By Judy Brown, Cox Communications; Matan Becker, Cox Communications
2024
Designing a Cloud-Based DOCSIS Time Protocol Calibration Database
By Roy Sun, Rahil Gandotra, Ph.D. & Mark Poletti, CableLabs, Inc.; Jennifer Andreoli-Fang, Ph.D., Amazon Web Services (AWS); Elias Chavarria Reyes, Ph.D., Hitron Technologies, Inc.; John Chapman, Cisco Systems, Inc.
2021
Modernizing Cox Communication’s Access and Aggregation Network Infrastructure for Remote PHY Deployment
By Deependra Malla, Cox Communications Inc.
2021
More Results >>