Threat Detection – ML for Cloud Security

4 million logins per day, 50% fewer false alarms – machine learning for cloud security.

Industry Industry & Enterprise

Services Data Engineering

Period 2021

Challenge

Cloud services are constantly under attack. Static rules for login detection produce too many false alarms and unnecessarily burden security teams. With over 4 million login requests daily, a smarter solution was needed that could distinguish real threats from harmless anomalies.

Solution

Replacement of static detection rules with a flexible Gradient Boosting model
Intensive feature engineering phase to identify optimal model characteristics
Development of efficient low-level data connectors with memory optimizations and parallelization strategies
Processing of terabytes of login data from AWS Data Lakes
Integration into the customer's live production system

Our Contribution

Analysis of existing data (regular logins, confirmed account takeovers)
Development of ground truth as data basis for training
Development of the machine learning classification model (Gradient Boosting)
Intensive feature engineering phase to identify optimal model characteristics
Development of specialized data connectors for efficient processing of terabytes from AWS Data Lakes
Testing and validation of the model

Technologies

Python Gradient Boosting AWS Big Data

Results

50–60% fewer false positives at 4 million login requests daily. The security team can focus on real threats.

Back to all projects Get in touch