Amber Group is a global leading digital asset company providing crypto financial services to both institutional and high-net-worth investors globally.
We offer best-in-class liquidity solutions and cutting-edge trading infrastructure across major exchanges, applications, and networks. With over $1 trillion in cumulative trading volume, our deep liquidity helps power the digital asset ecosystem.
Beyond trading, our full-suite of offerings includes wealth management, lending and investing products. But at our core, we focus on building strong relationships and delivering personalized service to help clients navigate this fast-growing industry.
At Amber, security is our #1 priority. We have invested years of effort and millions of dollars in cybersecurity, crypto-security, and operational security across the firm, with industry-leading certifications like SOC 2 Type II and ISO 27001.
Powered by a 400+ team of traders, technologists and engineers operating 24/7 globally, our technology and research capabilities are world-class. Yet we remain entrepreneurial, always seeking fresh ideas and risks worth taking. We are always interested in people who have an appetite for taking calculated risk, demonstrate a high level of original thinking and intellectual curiosity.
Roles & Responsibilities:
Node Management: Deploy, manage, and maintain blockchain nodes across various environments ensuring high availability and reliability. - Infrastructure Automation: Develop and maintain Infrastructure as Code (IaC) to automate node deployment and configuration using tools like Terraform, Ansible, or equivalent.
Shared Infrastructure Management: Follow best practice to deploy and maintain shared infrastructure tools like Airflow, Slurm, Ray, etc.
Monitoring & Alerting: Implement robust monitoring, logging, and alerting solutions for early detection of issues using tools such as Prometheus, Grafana, or ELK Stack. - Performance Optimization: Analyze and optimize node performance and scalability to handle increased load and transaction volume.
Incident Response: Lead incident response efforts, troubleshoot issues, and implement fixes in real-time to minimize downtime.
Collaboration: Work closely with developers and product teams to integrate new features and support ongoing projects.
Security Practices: Ensure compliance with security best practices and contribute to regular security assessments and audits.
Documentation: Maintain comprehensive documentation for node management processes, configurations, and standard operating procedures.
Requirements:
Education: Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.
Experience: 3+ years of experience in Site Reliability Engineering, DevOps, or equivalent roles, with a focus on blockchain node management. Technical Skills: - Proficient in scripting languages such as Python, Bash, or similar.
Experience with cloud platforms like AWS, GCP, or Azure.
Strong understanding of containerization technologies, such as Docker and Kubernetes. - Familiarity with networking concepts and protocols, particularly those relevant to blockchain networks.
Soft Skills:
Excellent problem-solving abilities and analytical skills.
Strong communication skills and ability to work effectively in a team environment.
Proactive mindset with a passion for reliability and automation.
Preferred Qualifications:
Experience with blockchain technologies like Ethereum, Bitcoin, or other distributed ledger technologies.
Experience in FinOps
Previous contributions to open-source blockchain projects.
Certifications in cloud computing or DevOps practices.