Senior Data Architect who
transforms raw, complex data into actionable intelligence — self-taught
I design and build end-to-end systems that turn raw blockchain activity into actionable financial intelligence. OmniSight ingests Base Mainnet USDC flows, decodes EVM payloads, and scores wallet behaviour with ML — production-grade, latency-aware, and API-first.
Why OmniSight matters
Web3 moves fast and ledger data is noisy. OmniSight turns raw USDC transfers into structured, ML-ready intelligence for compliance, fraud detection, and institutional reporting.
Core capabilities
- Autonomous ingestion (Airflow) with 120s cadence
- EVM hex decoding and USDC normalisation via web3.py
- Partitioned PostgreSQL fact tables for scale
- Isolation Forest ML anomaly scoring (RobustScaler)
- FastAPI gateway — authenticated + public endpoints
OmniSight Web3 Data Engine
A production-grade intelligence engine built entirely from scratch — no bootcamp, no team, no starter template. Ingests Base Mainnet USDC flows, decodes them into partitioned fact tables, and scores wallet behaviour with a trained ML model exposed via a documented REST API.
System snapshot
- 120s ingestion cadence via Airflow pulling JSON-RPC event logs
- EVM decoder → typed analytics fields (wallet, amount, block)
- Partitioned tables — usdc_transfers_era_47m, era_48m, growing
- RobustScaler + Isolation Forest — trained on 6,079 real wallet profiles
- FastAPI + Nginx + TLS — live at ericdiamason.tech/docs
- Oracle Cloud Infrastructure — always-on, zero cost (Always Free tier)
Business outcomes
- Compliance-grade monitoring for exchanges and institutions
- Real-time alerts for high-value movements ≥$50,000 USD
- ML wallet risk scoring in under 200ms per request
- Full audit trail: block number, tx hash, ingestion timestamp
Market context
The blockchain analytics market is projected to reach $18.7 billion by 2034 (22% CAGR). OmniSight demonstrates production-level capability in the space occupied by Chainalysis, Elliptic, and TRM Labs — built independently, from scratch, on a free cloud tier.
Public API
Live, documented endpoints. Try the wallet risk scorer below in the Live Metrics section.
GET /api/v1/metrics/whale-alerts
GET /api/v1/public/wallet-risk?wallet_address=0x…
View full API docs →
GitHub
Open source. Production code, clean secrets management, documented architecture.
github.com/ericdiamason/omnisight →From ledger to risk score in one coherent pipeline
Eight stages, zero gaps. Every layer is designed, deployed, and maintained by one engineer.
Pipeline
Key design decisions
- Partitioned tables: block-range partitioning for query performance at scale
- RobustScaler: handles extreme whale outliers without discarding signal
- ON CONFLICT idempotency: pipeline is safe to re-run against any block range
- asyncpg pool: persistent connections, async I/O, handles concurrent requests
- Secrets via EnvironmentFile: zero credentials in source code or git history
- Rate-limited public endpoint: open access with abuse protection
Operational facts
- 214,000+ clean records, zero duplicates enforced by DB constraint
- Model version v20260617 — trained on real Base Mainnet data
- API survives server reboots via systemd service
- Log rotation configured — no unbounded disk growth
Real data. Real pipeline. Right now.
Every number below is pulled live from the OmniSight API — no mock data, no static fixtures.
Whale alerts — live feed
—| Block | Tx | Sender | Receiver | Amount (USD) | Ingested |
|---|---|---|---|---|---|
| Loading whale alerts… | |||||
Wallet Risk Explorer
Enter any wallet address to run it through the live Isolation Forest ML model. Powered by real Base Mainnet data.
Teams that benefit from OmniSight
I partner with Web3 infrastructure teams, fintech and compliance groups, exchanges, and analytics platforms to deliver production-grade telemetry and ML-driven risk intelligence.
Exchanges & Custodians
Real-time monitoring for large movements, counterparty tracing, and compliance workflows.
Compliance & AML
Automated alerts, ML risk tiers, and evidence-ready transaction trails for investigations.
Fintech & Analytics
Operational dashboards, institutional reporting, and data products built on reliable pipelines.
Let's build something that matters
If you need architecture, data engineering, or ML for Web3 risk and compliance, reach out. I consult, partner, and build production systems end-to-end.
Engagements I take
- Proof-of-concept to production pipelines
- ML risk scoring and model operationalisation
- API design and integration for analytics teams
- Architecture reviews and performance tuning
- Web3 data infrastructure from scratch