January 28, 20267 min read

Email List Validation: Rules-Based vs AI — Honest Comparison

Regex or a neural network? SMTP handshake or ML scoring? The question is a bit like "fork or spoon" — it depends what's on the plate. We use both approaches and know exactly where each one breaks.

Rules: regex → MX → SMTP

The classic pipeline. Three steps, each cutting a different layer of bad addresses.

Regex is the fastest check. It confirms the address looks like an address: there's an @, the domain isn't empty, no spaces. Runs in microseconds. Catches typos, random input, strings like "asdf". Simple, but it removes a surprising amount of garbage.

MX lookup checks DNS. If a domain has no MX record, it doesn't accept mail. That's final. 50–200 ms, and you already know: sending there is pointless.

SMTP handshake connects to the mail server, issues RCPT TO, and listens. 250 means the mailbox exists. 550 means it doesn't. It's the only way to check a specific address rather than just its domain.

This pipeline is fast, cheap, and predictable. For 70% of addresses it's enough. The remaining 30% is where things get painful.

Three gaps in rules

Catch-all. The server returns 250 OK for any address — even sdkjfhskdjf@company.com. SMTP is useless here. Around 15–20% of corporate domains run catch-all configuration.

Disposable addresses. Guerrilla Mail, Temp Mail, and a few hundred similar services. The domain is live, MX is in place, SMTP responds. Everything looks fine. An hour later the inbox is gone. You can maintain a blocklist, but new services appear faster than you can add them.

Spam traps and bot-generated addresses. test123456@gmail.com passes every check. A recycled trap does too. Sending to either kills your domain reputation. Regex can't tell the difference between john.smith and xk7q9mz2.

ML models: a different lens

AI doesn't replace rules — it looks at the problem from another angle. The model trains on millions of addresses with known outcomes: delivered, bounced, complaint. It learns patterns that don't fit into rules.

Character distribution in the local part. Domain age. Naming conventions within a company. Correlation between provider and address type. All of this becomes a score: the probability the address is live and safe.

Catch-all domain? The model evaluates how much ivanov@company.com fits the actual naming patterns used on that domain. Disposable? No blocklist needed — the model reads the domain's features and classifies even services that launched yesterday. Bot-generated addresses? High local-part entropy gives them away.

But AI isn't magic

A model is only as good as its training data. A skewed dataset means errors on unfamiliar regions or exotic TLDs.

Transparency is a real problem. Regex broke? Open it, look, fix it. Model returned 0.43 instead of 0.87? Good luck debugging that.

Models also decay. Spam patterns shift, new disposable services don't look like old ones. Without regular retraining, accuracy drifts.

And inference has a cost. On a single address it's invisible; on a million addresses it adds up.

Criterion

Rules

AI / ML

Syntax and format

✅ Ideal — regex in microseconds

🔸 Overkill, rules handle it fine

Domain existence

✅ MX lookup, fast and accurate

🔸 Possible, but DNS is cheaper

Mailbox existence

✅ SMTP handshake (when server is honest)

🔸 Complements but doesn't replace SMTP

Catch-all domains

❌ SMTP useless — always returns 250

✅ Scores by pattern probability

Disposable addresses

⚠️ Blocklists go stale

✅ Classifies by domain features

Spam traps

❌ Indistinguishable from real addresses

✅ Scores on entropy and behavior

Speed

✅ Fast, especially regex + MX

⚠️ Inference costs more at scale

Transparency

✅ You can see which rule fired

⚠️ Black box, hard to debug

Adaptability

❌ Manual rule updates required

✅ Retrains on new data

✅ strong · ⚠️ works with caveats · ❌ doesn't work · 🔸 overkill

In practice: two stages

Framing this as a competition gets it wrong. In our experience, the best results come from a two-stage pipeline.

First, rules. Regex, MX, SMTP. Fast, cheap, removes 60–70% of the garbage. No prediction needed here — the answers are binary.

Then ML. Addresses that pass the first filter go through the model. It assigns scores: bounce probability, disposable flag, spam-trap risk. You set the threshold — conservative or aggressive.

Rules handle the bulk. AI handles the edge cases. Together they're more accurate than either alone.

What uChecker does

Exactly this pipeline. Upload a list — every address goes through syntax, DNS, SMTP, then AI scoring. The output isn't just "valid / invalid": it's a report with reason, category, and risk level.

Catch-all domain? We show the probability the mailbox is real. Disposable? Flagged, even if the service launched last week. Suspicious pattern? You get a risk score and make the call.

Want to check your list? Upload your file to uChecker — results in minutes.