Andrew Harris

Andrew Harris

Data Acquisition & Engineering Leader | Scaling Data Platforms from Startup to IPO

Architect of billion-scale crawlers, agentic extraction pipelines, and AI-powered data intelligence systems

Professional Summary

Director of Web Data Acquisition and engineering leader with nine years at ZoomInfo, scaling the company from startup to public market leader. Architect of billion-scale crawlers, agentic extraction pipelines, and vectorized data platforms powering commercial intelligence. Expert in distributed systems, real-time data ingestion, cloud-native orchestration, and high-precision LLM/SLM-based extraction. Proven builder of high-performance teams across backend, data infrastructure, and machine learning. Delivers scalable, fault-tolerant data platforms that drive product innovation and enterprise revenue.

Professional Experience

Senior Manager, Agentic Web Data Acquisition
ZoomInfo · Mar 2023 - Present
Vancouver, WA
Senior Manager, Software Engineering
ZoomInfo · Apr 2021 - Mar 2023
Vancouver, WA
Manager of Data Strategy
ZoomInfo · Oct 2019 - Mar 2021
Vancouver, WA
Operations Analyst / Special Projects Analyst
DiscoverOrg · Jul 2016 - Oct 2019
Vancouver, WA

Side Projects

Google Search and Maps SERP scraper with Flask web UI, PostgreSQL storage, and batch CSV processing

Agentic RAG platform for navigating the legal system - crawls, indexes, and enables semantic search across legal documents

Fishing spot discovery platform for anglers. React/TypeScript frontend, Express backend, PostGIS for geospatial queries

Technical Skills

Data Infrastructure
Postgres pgvector AWS S3 EC2 Lambda Kinesis Redis Docker GitHub Actions Airflow Kafka Spark Beam Snowflake BigQuery GCS Varnish
Languages & Frameworks
Python SQL FastAPI Flask Scrapy Playwright Selenium BeautifulSoup Node.js Neo4J Spanner NLTK spaCy Go PyArrow LlamaIndex RAG
Search & Crawling
SERP Orchestration Proxy Management Rate Limiting Diff-based Crawling Apache Nutch Frontera Crawl4AI Colly Guzzle Headless Fingerprinting CAPTCHA Resolution
Machine Intelligence
OpenAI Anthropic LLMs Custom SLMs Embeddings Entity Resolution Enrichment Models LangChain LangGraph HuggingFace Vertex AI Groq
Leadership & Strategy
Hiring & Scaling Technical Mentorship OKR Design CI/CD Cross-functional Leadership Vendor Integration Data Lineage GDPR CCPA

Education

Master of Public Affairs (M.P.A.), Applied Policy
Washington State University
Bachelor of Arts (B.A.), History & Development of Language
Winthrop University
Sigma Tau Delta - Chapter President

Notable Achievements

Transformed ZoomInfo's acquisition codebase into a billion-page, enterprise-grade web intelligence platform driving core product data.
Advanced from analyst to director scope assuming ownership of the company's mission-critical acquisition infrastructure and strategy.
One of the first 150 employees instrumental in growing engineering and data infrastructure through 3,500+ headcount and an explosive IPO.