dataqbs
Cover
Carlos Carrillo

Carlos Carrillo

(He/Him)

AI-Driven Engineer | Data · Developer · DBA | Snowflake · Azure SQL · ADX/KQL · Python | Remote (EN/ES)

Mexico · Remote (Worldwide) · Contact info

500+ connections

dataqbs

Open to work

SQL Developer, ETL Developer, Data Engineer, Integration Lead, AI Engineer roles

About

I'm a Senior Data Engineer and Cloud Data Consultant with 20+ years of experience modernizing analytics ecosystems with Snowflake, Microsoft Fabric, Azure SQL, and SQL Server. I build automated, scalable pipelines and resilient data models that turn raw data into reliable, actionable insight — especially in high-volume, mission-critical environments where performance, cost efficiency, and long-term maintainability are survival. My toolkit is deep SQL + Python, paired with AI-assisted development (GitHub Copilot, ChatGPT, Claude) to deliver solutions that are cloud-native, operationally practical, and designed to evolve beyond prototypes.

"To live in peace, free from rigid structures — building projects that flow naturally through intelligence and awareness. Technology should serve life, not the other way around."

Experience

Data Integration Lead

Hexaware Technologies contract

Mar 2025 — Present

Mexico · Remote

Led Snowflake → Azure SQL integration for Freeport-McMoRan mining operations. Deployed incremental sync pipelines, built regression testing CLI, optimized Snowflake views, and developed production dashboards and AI chatbots for 7 mining sites.

SnowflakeAzure SQLADX/KQLAzure FunctionsApp ServicePythonStreamlitDockerGitHub EnterpriseCopilotMERGE/UpsertCDC/DeltaCTE RefactoringETL/ELTIoT Sensor DataMining AnalyticsEntra ID/KerberosCI/CD

Senior Database Engineer

FussionHit contract

Jan 2025 — Present

Remote

Database engineer for VCA Animal Hospitals on Azure Database for PostgreSQL. Built a full auditing and DDL export framework, performed schema performance reviews, and delivered ticket-based database remediation with TDD-quality documentation across multiple production databases.

PostgreSQLAzure Database for PostgreSQLNode.jsJavaScriptNunjuckspg_stat_statementsEXPLAINJiraHarvest APIGitHub Copilot

Data Engineer & AI Developer

dataqbs self-employed

Jan 2011 — Present

Guadalajara, Mexico · Remote

Independent consultancy providing BI, data engineering, and database solutions for US and LATAM clients. Also building internal R&D projects: crypto arbitrage scanner, grid trading bots, LLM evaluation engine, email classification system, and this portfolio site with RAG chatbot.

PythonSQL ServerPostgreSQLSnowflakeSSIS/SSRS/SSASTableauPower BIDataiku DSSAzure Data FactoryNode.jsccxtpandasAstroSvelte

Certifications

❄️

Generative AI Professional Certificate by Snowflake

Snowflake

Issued by Snowflake · 2026

📊

Microsoft Certified: Fabric Data Engineer Associate

Microsoft

Issued by Microsoft · 2025 · Expires 2026

Credential ID: C3B8D9CD1A3E7F8

Featured Projects

Crypto Arbitrage Scanner

FinTech

Scans 9 exchanges (Binance, Bitget, Bybit, Coinbase, OKX, KuCoin, Kraken, Gate.io, MEXC) for price inefficiencies. Uses Bellman-Ford shortest-path algorithm and triangular arbitrage detection. Includes a Swapper module for executing trades, WebSocket L2 order-book feeds, SDK bootstrapping for native exchange integrations, and a real-time balance monitor.

  • 4,000+ LOC scanner with graph-based arbitrage detection
  • 9 exchange integrations with 4 balance provider backends
  • Live swap executor with dry-run and production modes
PythonccxtpandasWebSocketPyYAMLBinance SDKujson

OAI Code Evaluator

AI / ML

YAML-driven evaluation pipeline with rule-based scoring across Instructions, Accuracy, Optimality, Presentation, and Freshness dimensions. Supports regex/substring matching, threshold conditions, ranking normalization, rewrite post-processing, and structured audit metadata output.

  • 6-stage evaluation pipeline (adjust → rules → rank → rewrite → validate → summary)
  • Declarative YAML rules with regex, substring, and threshold conditions
  • 5-dimension scoring with configurable ideals and tolerances
PythonRichPyYAMLjsonschemaJinja2

Email Collector & Classifier

Automation

Multi-account IMAP collector supporting Gmail, Hotmail (MSAL OAuth device-flow), and Exchange. Classifies emails into Scam/Suspicious/Spam/Clean/Unknown using a weighted scoring engine with 200+ domain rules, URL-shortener detection, phone-pattern matching, and fuzzy deduplication.

  • 5-label classifier with weighted scoring and hard rules
  • 200+ domain classification rules
  • OAuth device-flow for Hotmail/Outlook
Pythonimap-toolsMSALlangdetectPyYAML

dataqbs.com Portfolio

AI / ML

This very website — a LinkedIn-style portfolio with RAG-powered AI chatbot, built with Astro + Svelte + Tailwind on Cloudflare Pages.

  • RAG chatbot with vector embeddings + Groq LLM streaming
  • Knowledge pipeline: markdown → 58 chunks with 768-dim embeddings
  • i18n (EN/ES/DE), dark mode, LinkedIn-style layout
AstroSvelteTailwind CSSCloudflare Workers AIGroqTypeScript

MEMO-GRID

FinTech

Production grid trading microservice using ccxt with Binance Spot. Features Optuna hyperparameter optimization (50K trials), backtest engine with real fee modeling, attribution analysis (alpha vs beta decomposition), Monte Carlo projections, and 22 analysis tools. Includes FIFO inventory tracking, adaptive step sizing, and systemd deployment support.

  • HPO with 50,000 Optuna trials (TPE sampler) for ETH/BTC grid parameters
  • Backtest engine spanning 2017–2026 with maker fee modeling
  • Attribution analysis: alpha vs beta return decomposition
PythonccxtOptunapandasNumPyPyYAMLpytest

VCA PostgreSQL Audits

Data Eng.

Full audit and schema management framework for Azure Database for PostgreSQL. Includes per-object DDL export with Nunjucks templates, automated schema discovery, LLM-friendly schema_knowledge.json generation, and 20+ ticket-based database improvements across index optimization, FK remediation, timestamp normalization, and stored procedure reviews.

  • 20+ tickets: index optimization, FK remediation, schema renames, timestamp fixes
  • Templated per-object DDL exporter (Nunjucks) for CI/CD-friendly snapshots
  • Technical Design Documents for 5+ database systems
PostgreSQLNode.jsJavaScriptNunjucksAzure PostgreSQL

IROC Video Wall Dashboard

Data Eng.

Streamlit-based production monitoring dashboard for IROC operations across 7 Freeport-McMoRan mining sites. Features real-time metrics from Snowflake and Azure Data Explorer (ADX), 34 KPIs covering dig compliance, crusher rates, cycle times, and ROM tonnage. Includes RAG-powered AI chatbot with GitHub Copilot SDK, semantic model with 16 business outcomes per site, and auto-refresh every 60 seconds.

  • 34 KPIs across 7 mining sites with real-time auto-refresh
  • AI chatbot with RAG + GitHub Copilot SDK (zero-cost for enterprise)
  • Semantic model: 16 business outcomes × 7 sites with ADX + Snowflake queries
PythonStreamlitSnowflakeAzure Data ExplorerKQLGitHub Copilot SDK

Skills

💻 Languages

Python Expert
SQL Expert
JavaScript / TypeScript Advanced
KQL (Kusto) Advanced
Bash Advanced
Node.js Advanced
PowerShell Intermediate

☁️ Data & Cloud

Snowflake Expert
Azure (SQL, ADF, Functions) Advanced
Azure Data Explorer (ADX) Advanced
Microsoft Fabric Intermediate
Cloudflare (Pages, Workers, AI) Intermediate

🤖 AI & ML

LLM Evaluation & Prompt Eng. Advanced
RAG (Retrieval-Augmented Gen.) Advanced
Snowflake Cortex AI Advanced
Vector Embeddings & Search Advanced
GitHub Copilot SDK Advanced
Optuna (HPO) Advanced
Fine-Tuning (PEFT / LoRA) Intermediate

📦 Libraries & Frameworks

pandas / NumPy Expert
ccxt (crypto exchanges) Expert
Streamlit Advanced
Playwright Advanced
Astro / Svelte Intermediate
Nunjucks / Jinja2 Advanced
Rich / rapidfuzz Advanced

🔧 DevOps & Tools

GitHub Actions CI/CD Advanced
Poetry / pip Expert
ruff / pre-commit / pytest Advanced
Docker Intermediate
QEMU / KVM Intermediate
Linux (Pop!_OS) Advanced

🗄️ Databases

SQL Server Expert
Snowflake Expert
PostgreSQL Expert
Azure SQL / Azure PostgreSQL Advanced
SQLite Advanced

Contact