Systems Engineer- Site Reliability

Regrid

Regrid

Software Engineering
Remote
USD 140k-180k / year
Posted on Apr 15, 2025

Regrid - Map Your Future Systems Engineer- Site Reliability Remote · Full time

Location: Full-time, remote position open only to applicants eligible to work in the United States.

Description

Regrid is a dynamic spatial data company building software and data products to deliver a nationwide dataset of 158+ million land parcels, 186+ million building footprints, and 180+ million addresses covering 100% of the US population. We offer our data in multiple formats to customers across private and public sector verticals via bulk data files, API, Esri-compatible feature service and our property app.

Over the years we have created a successful suite of products, a growing customer base and project portfolio, and an effective organization. Our products are built in-house from scratch and are a labor of love that has become great business.

The Role

We are seeking a Systems Engineer - Site Reliability (SRE) to help lead management of the systems and infrastructure that power our operations. As an SRE, you will leverage your deep experience to enhance the reliability, performance and scalability of mission-critical systems. This role will work very closely with our CTO, along with software developers and data engineers in a crucial role to ensure the continuity and optimal functioning of our systems and applications.

You’re self-driven and collaborative whether you’re tuning configurations or automating tasks. You’re knowledgeable and passionate about building stable products and environments and also open to learning new pa. You’re comfortable working with senior technical colleagues and customer-facing teams to ensure we’re growing in the best possible way.

We recognize that individuals from marginalized and underrepresented backgrounds are less likely to apply for positions if they do not meet 100% of the job requirements. If you meet the majority of the listed duties and requirements, we encourage you to apply.

Key Responsibilities

  • Collaborate with a fully remote team of software developers and data engineers to maintain reliable internal and external systems
  • Design, maintain, and monitor IT infrastructure to support web applications, APIs, and services
  • Manage and optimize cloud infrastructure (preferably Linode/Akamai), ensuring scalability, security, and cost-efficiency
  • Monitor and troubleshoot systems to identify and resolve bottlenecks, bugs, and performance issues
  • Support and test backup and disaster recovery systems and processes
  • Contribute to the implementation and improvement of the company’s Incident Response plan (on-call rotation required)
  • Evaluate new data and analytical projects, including cluster computing and infrastructure requirements
  • Identify opportunities to automate routine tasks and operational processes
  • Write and maintain clear, comprehensive documentation
  • Promote operational excellence, security best practices, and reliability across teams

Tools & Technologies You’ll Work With

  • Systems & Infrastructure: Linux server management, system security, and network administration
  • Containerization & Orchestration: Docker, Kubernetes, cloud-native tools
  • Cloud Providers: Hands-on experience with Linode/Akamai preferred; experience with any major cloud platform required
  • Networking: DNS, TCP/IP, firewalls, load balancers, and high-availability systems
  • Monitoring & Observability: Grafana, Prometheus, OpenTelemetry, Loki
  • Incident Management: Familiarity with PagerDuty, StatusPage, Sentry, and reliability engineering best practices
  • Automation & Scripting: Scripting for task automation and system management
  • Version Control: Solid understanding of Git
  • On-Call: Willingness to participate in a shared on-call rotation

Job Requirements

  • 4+ years of experience in systems administration, network engineering, IT, and/or software development, ideally within a remote-first and fast-paced environment
  • Strong troubleshooting skills and the ability to architect solutions to address bugs and mission-critical issues effectively
  • Familiarity with geospatial data and tools such as GDAL, Mapbox, PostGIS, ArcGIS, or similar
  • Some programming experience in Ruby, JavaScript/TypeScript, or other modern programming languages
  • Understanding of Infrastructure as Code (IaC) concepts and familiarity with related tools (e.g., Terraform, Ansible, CloudFormation)
  • Experience working with relational and NoSQL databases, including PostgreSQL, MySQL, and MongoDB
  • Exposure to Elasticsearch for search and analytics use cases
  • Basic understanding of data architecture concepts, including data warehouses, data lakes, cluster computing, and large-scale analytics workflows
  • Some experience with CI/CD pipelines and associated tooling for continuous integration and deployment

Compensation and Benefits

  • Salary Range: $140,000- $180,000 (dependent on experience)
  • 100% Paid Health Insurance for Employee
  • 50% Paid Health Insurance for eligible dependents
  • HSA, Healthcare and Dependent Care FSAs with Company Contribution
  • 401K Retirement Plan with Company Match
  • Unlimited PTO
  • Paid Parental Leave
  • WFH, Learning and Development and Wellness Stipends
  • Flexible Remote Culture

Salary

$140,000 - $180,000 per year