Data Platform Engineer / Distributed Data Systems Engineer
- Islamabad, Islamabad Capital Territory, Pakistan
- Autoscale Ventures
- Full-Time
- On-Site
Job Description:
Location: Remote (Pakistan)
Job Type: Full-time
Work Hours: 09:00 AM – 05:00 PM PST (09:00 PM - 05:00 AM PKT)
Role Summary:
We are looking for a Data Platform Engineer to work on large-scale, production-critical distributed systems powering our global data infrastructure. This role focuses on improving reliability, scalability, and performance of backend data systems handling tens of millions of records daily. The ideal candidate should be comfortable debugging complex production issues, optimizing distributed database workloads, and improving large-scale export and processing systems without disrupting live client operations.
Key Responsibilities:
- Debug and optimize large-scale data exports and distributed processing pipelines handling multi-million record workloads
- Work directly with distributed databases such as YugaByte (or similar distributed SQL / NoSQL systems)
- Investigate and resolve production performance bottlenecks including query timeouts, slow exports, replication delays, and system instability
- Analyze query execution plans, indexing strategies, schema design, and partitioning approaches to improve reliability and scalability
- Design and implement scalable extraction and processing strategies including batching, streaming, partitioned exports, and asynchronous workflows
- Debug production issues safely without disrupting ongoing client-facing workloads
- Build internal debugging, observability, and monitoring tooling to improve system visibility and operational reliability
- Collaborate closely with backend engineers and architects to improve system performance, scalability, and maintainability
Requirements:
- Strong experience working on distributed systems or large-scale backend infrastructure
- Strong understanding of distributed database behavior, query execution plans, indexing strategies, partitioning, and performance optimization at scale
- Experience handling production systems processing large datasets and high-throughput workloads
- Familiarity with distributed databases such as YugaByte, CockroachDB, Cassandra, ScyllaDB, or similar technologies
- Strong debugging and production incident investigation skills
Nice to Have(Plus Points):
- Experience with high-volume export systems or client delivery pipelines
- Experience with streaming and messaging systems such as Kafka, NSQ, Pulsar, or RabbitMQ
- Familiarity with observability and monitoring tools such as Prometheus, Grafana, OpenTelemetry, or distributed tracing systems
- Experience working on production systems with strict uptime and delivery expectations
What you’ll work on(early projects may include):
- Designing, building, and maintaining backend systems for data-heavy applications, with a primary focus on large-scale crawlers and market feeds.
- Contributing to projects spanning vehicle history and market data (VinAudit.com), scalable scraping solutions, and proxy infrastructure across datacenter, ISP, and mobile networks (SquidProxies.com) — delivering aggregated insights to enterprise customers and building internal systems for scalability.
- Collaborating with cross-functional teams to optimize data pipelines, infrastructure, and performance.
- Solving backend infrastructure challenges across multiple products.
- Exploring opportunities to integrate AI/ML and private models into data-driven solutions.
Benefits & Perks:
- Fully Remote Work: Work from anywhere with reliable internet.
- Healthcare Coverage for you and your family.
- Paid Leave & Holidays, with time to rest and recharge.
- Equipment Funds Support to help set up or upgrade your home office.
- Profit Sharing: Monthly and annual bonuses tied to performance.
- Long-Term Career Growth Be part of a company that values stability and growth.
- Collaborative Culture Work with a supportive, globally distributed team.
About Us:
AutoScale Ventures is a technology-driven company with 50+ team members across the Philippines, Pakistan, India, the U.S., China, and Canada. We operate a group of businesses spanning tech, data services, and infrastructure, and AI, with some of our main products and ventures around:
- Vehicledata (VinAudit.com)
- Proxy infrastructure solutions (SquidProxies.com)
Only shortlisted candidates will be contacted. We look forward to meeting the right person for this role!