Rules vs. ML

On this page

Overview
Comparison
When to Use Rules
When to Use ML
The Hybrid Approach
Building Effective Rules
- Rule Anatomy
- Rule Hygiene
Building Effective ML
- Feature Engineering
- Model Considerations
Next Steps
Related Topics

Prerequisites

Before choosing detection approaches, understand:

Fraud types you're trying to detect
Fraud metrics and your current baseline
Available data for rule creation or ML training
Velocity rules as a starting point

TL;DR

Rules: Fast to deploy, interpretable, great for known patterns and regulatory requirements
ML: Better for complex patterns, scale, novel fraud. But needs data science expertise
Best approach: Hybrid. Hard rules (blocklists, OFAC) → ML scoring → soft rules (thresholds, overrides)
Start with rules: Use processor ML (Stripe Radar, Adyen Risk) until you have data science resources
Layer with velocity rules, device fingerprinting, behavioral analytics

Choosing between rule-based and machine learning approaches for fraud detection.

Overview

Both rules and machine learning have their place in fraud detection. The question isn't "which one" but "how to combine them effectively."

Comparison

Aspect	Rules	Machine Learning
Interpretability	High – clear logic	Lower – "black box"
Speed to deploy	Fast – hours/days	Slower – weeks/months
Maintenance	Manual updates	Retraining required
Novel fraud	Misses new patterns	Can detect anomalies
Known fraud	Excellent	Good
False positives	Higher if too strict	Optimizable
Expertise needed	Domain knowledge	Data science + domain

When to Use Rules

Rules excel when:

Known Fraud Patterns

IF transaction_country != billing_country
   AND account_age < 7_days
   AND transaction_amount > $500
THEN block

Regulatory Requirements

IF customer_on_OFAC_list = true
THEN block (no exceptions)

See AML Basics for compliance requirements.

Business Logic

IF order_contains(gift_cards) 
   AND order_total > $1000
   AND first_order = true
THEN manual_review

Immediate Response

When you discover a new fraud pattern, rules can be deployed in minutes.

When to Use ML

Machine learning excels when:

Pattern Complexity

Hundreds of features interacting
Non-linear relationships
Patterns too complex for human rule-writing

Scale

Millions of transactions
Need for real-time scoring
Too many segments for manual rules

Evolution

Fraud patterns shifting constantly
Need to catch novel approaches
Want to optimize over time

Probability Needed

Gradated risk scores (not just yes/no)
Threshold tuning required
Different actions at different confidence levels

The Hybrid Approach

Most production systems combine both:

Layer 1: Hard Rules (Pre-ML)

Blacklists
Regulatory blocks (OFAC, etc.) - See AML Basics
Impossible transactions (velocity > max)
Known fraud fingerprints (via device fingerprinting)

Layer 2: ML Scoring

Probability of fraud
Contextual risk assessment
Feature-rich evaluation

Layer 3: Soft Rules (Post-ML)

Threshold application (score > 80 = block) - See risk scoring
Override rules (VIP customers)
Manual review triggers
Business logic gates

Building Effective Rules

Rule Anatomy

RULE: High-risk first purchase
CONDITIONS:
  - first_order = true
  - order_value > $300
  - shipping_address != billing_address
  - email_age_days < 30
ACTION: manual_review
RATIONALE: New customers with high-value orders 
           to different addresses have 3x fraud rate
PERFORMANCE: 
  - Triggers: 2.3% of orders
  - Fraud rate when triggered: 8.2%
  - FP rate: 45%

Rule Hygiene

Document every rule – Purpose, conditions, rationale
Track performance – Hit rate, precision, recall
Review quarterly – Remove underperformers
Sunset old rules – Don't let rules accumulate
Version control – Track changes over time

Building Effective ML

Feature Engineering

Best features typically include:

Velocity (transactions per hour/day/week)
Deviation from normal (customer's own baseline)
Network features (links to other accounts)
Device/IP reputation
Time-based patterns

Model Considerations

Factor	Recommendation
Algorithm	Gradient boosting (XGBoost, LightGBM) often wins
Training data	Use confirmed fraud, not just chargebacks
Refresh frequency	Monthly minimum, weekly ideal
Feature stability	Monitor for drift
Explainability	Use SHAP values for investigation

Next Steps

Just starting fraud detection?

Start with rules → Known patterns are easier to block with rules
Use your processor's ML → Stripe Radar, Adyen Risk for baseline scoring
Build a review queue → Some transactions need human eyes

Improving your detection?

Analyze your rule performance → Which rules catch fraud? Which just add friction?
Add velocity rules → Velocity Rules guide
Layer signals → Rules + ML + device fingerprinting together

Going advanced?

Build custom ML models → If you have the data science resources
Invest in feature engineering → Better signals beat better algorithms
A/B test continuously → Track precision/recall tradeoffs

Velocity Rules - Detecting abuse patterns
Behavioral Analytics - How users interact
Device Fingerprinting - Tracking devices
Risk Scoring - Combining signals
Manual Review - Human investigation
Fraud Metrics - Measuring performance
Evidence Framework - Tier 1/Tier 2 indicators
3D Secure - Authentication layer
AVS & CVV - Verification signals
Processor Rules Configuration - Processor-level rules
Fraud Vendors - Vendor ML capabilities
Experimentation - Testing rule changes

Overview​

Comparison​

When to Use Rules​

Known Fraud Patterns​

Regulatory Requirements​

Business Logic​

Immediate Response​

When to Use ML​

Pattern Complexity​

Scale​

Evolution​

Probability Needed​

The Hybrid Approach​

Layer 1: Hard Rules (Pre-ML)​

Layer 2: ML Scoring​

Layer 3: Soft Rules (Post-ML)​

Building Effective Rules​

Rule Anatomy​

Rule Hygiene​

Building Effective ML​

Feature Engineering​

Model Considerations​

Next Steps​

Related Topics​