Who I work with
What I build
- Production scraping pipelines - proxy infrastructure, anti-bot strategies, retry semantics
- Data ingestion for RAG systems and AI agents - markdown-native, LLM-formatted output
- Custom endpoints and extensions on top of the Nodesnack API
- Migrations of legacy or broken scrapers to reliable, maintained systems
- End-to-end data delivery: scheduled CSV, S3, webhook, or direct database writes
Every engagement is scoped to the client's specific targets and delivery requirements.
What you get in the first two weeks
- A working scrape-to-delivery pipeline for one priority source - code, infra, and first data landed in your system
- Documented architecture: proxy strategy, rate-limit policy, retry semantics, failure modes
- Monitoring and a written weekly update so you know where things stand without a meeting
How we work together
- Written intake. You describe the data you need and the systems it feeds. 20 minutes, async, no call required.
- SOW within 48 hours. I send a written scope, deliverables, and start date.
- Start within the week. First deliverables land in the first 7 days of the engagement.
Engagement shape
Pre-scoped monthly feeds
Every feed is scoped to an input list you provide - tickers, company names, SKUs, ZIP codes, counties, VINs, domains - whatever the source takes as a lookup. Avg records and avg price below reflect a typical client volume; the real quote scales with list size and refresh cadence. Delivery is scheduled, normalized, and lands in S3, a webhook, or your database.
| Source | Category | Monthly delivery | Avg Delta | Avg price |
|---|---|---|---|---|
| LinkedIn (public profiles) | Firmographics | For your company list - employee counts, org charts, hiring trends, tech-role mix, company-page updates | ~30k | $1,550 |
| LinkedIn Jobs | Talent | For your company list or role-filter set - open postings, seniority, skills, applicant counts | ~40k | $1,550 |
| Amazon | Retail | For your ASIN, keyword, or category list - pricing, reviews, BSR, availability, seller buy-box | ~20k | $1,550 |
| SEC EDGAR | Financial | For your ticker or CIK list - 10-K / 10-Q / 8-K filings, insider transactions, 13F holdings | ~3k | $300 |
| Google Maps | Local | For your location + keyword combos - POIs with hours, reviews, categories, popular times | ~10k | $400 |
| Zillow | Real Estate | For your ZIP, address, or MLS-area list - listings, Zestimates, transaction history, price changes | ~10k | $500 |
| Indeed | Talent | For your company or role-keyword list - postings, salaries, locations, posting age | ~20k | $500 |
| Glassdoor | Firmographics | For your company list - reviews, salaries, interview questions, CEO approval, benefits | ~8k | $500 |
| Crunchbase | Financial | For your company or investor list - funding rounds, acquisitions, IPOs, board members | ~8k | $450 |
| Walmart | Retail | For your SKU or category list - pricing, reviews, availability, pickup options by store | ~20k | $1,550 |
| County Assessor/Recorder 3,000+ counties | Real Estate | For your target counties - monthly delta of ownership, deeds, tax assessments, liens, mortgages | ~200k | $2,800 |
| Secretary of State 50 states | Corporate | For your entity list - filings, registered agents, annual reports, UCC records across all 50 SoS portals | ~50k | $2,300 |
| OFAC Sanctions (SDN) | Compliance | Daily delta of the full SDN list - new and updated entries, alt names, vessels, IDs | ~300 | $250 |
| PACER | Legal | For your party, district, or docket-type filters - federal filings, opinions, bankruptcy records | ~5k | $500 |
| State Court Systems 50 states | Legal | For your party, docket-type, or county filters - civil and criminal records, judgments, liens, evictions | ~25k | $2,000 |
| NPI Registry (NPPES) | Healthcare | For your NPI, name, or taxonomy filter - provider lookups with specialty, address, affiliations | ~10k | $200 |
| Booking.com / Expedia | Travel | For your property or destination + date-range combos - pricing, availability, reviews, cancellation policies | ~25k | $2,100 |
| BuiltWith | Firmographics | For your domain list - detected technology stack, stack-change deltas, category coverage | ~10k | $400 |
| Google News | News | For your keyword or entity list - daily article aggregation across publishers, deduped and clustered | ~40k | $800 |
| Reddit (public subreddits) | Sentiment | For your keyword or subreddit list - posts, comments, sentiment, thread velocity | ~75k | $800 |
| FINRA BrokerCheck | Compliance | For your CRD or broker-name list - registrations, disciplinary records, employment history | ~10k | $450 |
| Greenhouse / Lever / Workable | Talent | For your company list - open roles, team signals, hiring velocity from public ATS boards | ~10k | $500 |
| Custom careers pages | Talent | For your company list - bespoke scrape of any company's /careers site, including those not on a standard ATS | per company | $0.20–$3 / co. |
| X (Twitter) | Sentiment | For your keywords or account list - mentions, sentiment, engagement, influencer reach | ~100k | $1,900 |
| FDA (openFDA) | Healthcare | For your drug or device list - approvals, adverse events, recalls, 510(k), inspections | ~5k | $300 |
| Yahoo Finance | Financial | For your ticker list - quotes, historicals, fundamentals, analyst estimates, options chains | ~10k | $600 |
| Kayak / Google Flights / Skyscanner | Travel | For your origin-destination + date pairs - airfare pricing, fare calendars, price-change tracking | ~30k | $1,900 |
| CoinGecko / CoinMarketCap | Crypto | For your token list - price, volume, market cap, exchange-level data | ~15k | $800 |
| Etherscan / BscScan / Polygonscan | Crypto | For your wallet or contract list - transactions, token transfers, gas, internal calls | ~100k | $800 |
| Pharmacy & Medical Boards 50 states | Healthcare | For your NPI or name list - license verification and disciplinary status across all 50 state boards | ~40k | $2,200 |
| State Attorney General Actions | Compliance | Daily sweep of all 50 AG sites - new enforcement actions, data-breach notifications, settlements | ~2k | $2,000 |
| MarineTraffic / VesselFinder | Supply Chain | For your vessel, IMO, or fleet list - positions, port calls, ETAs, voyage history | ~30k | $800 |
| State Procurement Portals 50 states | Gov Contracts | Daily sweep across all 50 state portals - new RFPs, contract awards, vendor registrations matching your keyword filters | ~40k | $2,600 |
| Port Authority Sites 50+ ports | Supply Chain | For the ports you select - container volumes, congestion metrics, vessel schedules, berth assignments | ~20k | $2,600 |
| GoodRx | Healthcare | For your drug + ZIP list - pharmacy-level prices, coupons, generic alternatives | ~15k | $550 |
| Carfax (public listings) | Automotive | For your VIN list - history summaries, accident indicators, service records | ~10k | $1,350 |
| Wayfair | Retail | For your SKU or category list - pricing, reviews, availability, sale status | ~15k | $1,350 |
| Instacart / FreshDirect | Grocery | For your SKU + ZIP list - grocery pricing and availability by store and retailer | ~25k | $1,350 |
| StockX / GOAT | Resale | For your product or SKU list - resale pricing, sales history, price premiums, size-level liquidity | ~20k | $2,100 |
| Shodan | Security | For your IP-range, org, or product-string set - open ports, service banners, vulnerabilities | ~40k | $800 |
| NFT Marketplaces (OpenSea / Blur) | Crypto | For your collection list - floor prices, volume, sales, rarity, holder concentration | ~30k | $1,600 |
| Glassnode (public) | Crypto | For your asset list - on-chain metrics, exchange flows, miner data, HODL waves | ~10k | $1,050 |
| Dow Jones / World-Compliance Watchlists | Compliance | For your entity list - PEP flags, adverse media, watchlist screening with change tracking | ~100k | $1,050 |
| State Insurance Rate Filings | Insurance | For your carrier or line-of-business filters - filed rates, forms, actuarial justifications, approval status | ~3k | $1,750 |
| FlightAware / FlightRadar24 | Aviation | For your tail-number, route, or operator list - live flight data, delays, airport performance, history | ~60k | $1,600 |
| DAT Freight & Analytics | Logistics | For your origin-destination lane list - spot rates, rate trends, capacity signals | ~8k | $1,050 |
| Blind (public threads) | Sentiment | For your company or topic list - thread content, sentiment, compensation signals, layoff chatter | ~15k | $1,050 |
No feeds match that search. Try a broader term, or email for a custom quote.
Record volumes and prices are averages - both scale with your input list size and how often you want the feed refreshed. A few feeds (e.g. custom careers pages) are priced per unit rather than a monthly flat. Don't see the source you need? The same infrastructure handles ~any public web source - email for a tailored quote.
Common questions
Can you start faster than one week?
Do you sign NDAs?
Do you work with non-AI companies?
About me
Director-level data acquisition leader at a public company, with nine years of production scraping, pipelines, and agentic extraction work. I run Nodesnack on the side - the same infrastructure I use for client engagements. See resume and portfolio for context.
Get in touch
To discuss an engagement, email andrew@abharrismethods.com.
- Company & product context
- Data sources you need
- Delivery format & cadence
- Rough timeline
- Anything else worth knowing
> replies within one business day