Site Reliability Engineer SRE

BHFT

Not Interested
Bookmark
الإبلاغ عن هذه الوظيفة

profile موقع الوظيفة:

دبي - الإمارات

profile الراتب شهرياً: لم يكشف
تاريخ النشر: نُشرت قبل 2 يوم
عدد الوظائف الشاغرة: 1 عدد الوظائف الشاغرة

ملخص الوظيفة

We are looking for a Site Reliability Engineer who will be responsible for ensuring the reliable operation of our platform working with metrics to improve production process efficiency and participating in testing new product versions.

Responsibilities:

  • Production Stability Management: Ensure continuous compliance with external regulatory requirements and internal standards including risk security technology and trader needs. Support and automate validation and monitoring processes for adherence to necessary standards.

  • Incident Monitoring & Management: Develop and improve monitoring and alerting systems to detect anomalies in key production metrics. Implement rapid response mechanisms and efficient solutions to maintain strategy performance.

  • Release & Change Management: enforce standards for managing releases and changes to minimize deployment risks. Implement strict acceptance testing for all releases.

  • Process Management: Develop and maintain Standard Operating Procedures (SOPs) for the team manage task queues and organize shift schedules to ensure continuous support and high availability of trading strategies.

  • Integration Projects: Lead initiatives to connect with new exchanges brokers and trading platforms ensuring smooth and secure service integration.

  • Technical Performance Optimization: Continuously improve system availability resilience (MTTR MTBF) and latency reduction while optimizing data exchange performance and order routing to maximize profitability.


Qualifications :

Requirements:

  • Deep understanding of trading processes and market microstructure including colocation trading on native exchange protocols and algorithmic trading.
  • Experience in monitoring alerting systems and incident management for highload environments.
  • Knowledge of regulatory compliance and security standards.
  • Proficiency in monitoring and incident management tools such as Grafana ClickHouse Prometheus Opsgenie Grafana OnCall PagerDuty etc.
  • Experience developing and managing SOPs and KPIs for service teams.
  • Experience managing integration projects with brokers and exchanges.

Strong technical skill set including:

  • Linux systems administration and optimization.
  • TCP/UDP multicast networking.
  • FIXbased and native exchange protocols
  • Colocation infrastructure setup and management.
  • Python scripting for automation and monitoring.
  • English proficiency at C1 level or higher.

Remote Work :

Yes


Employment Type :

Fulltime

We are looking for a Site Reliability Engineer who will be responsible for ensuring the reliable operation of our platform working with metrics to improve production process efficiency and participating in testing new product versions.Responsibilities:Production Stability Management: Ensure continuo...
اعرض المزيد view more

المهارات المطلوبة

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • إدارة الصيانة بالحاسب الآلي
  • الصيانة
  • مهندس ميكانيكي
  • التصنيع
  • استكشاف الأخطاء وإصلاحها

عن الشركة

Company Logo

BHFT is a proprietary algorithmic trading firm. Our team manages the full trading cycle, from software development to creating and coding strategies and algorithms. Our trading operations cover key exchanges. The firm trades across a broad range of asset classes, including equities, e ... اعرض المزيد

عرض صفحة الشركة عرض صفحة الشركة