US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs


Sr Software Engineer

• Summary:

Lead design and development of monitoring solutions, automation, and data analysis for Windows Server and Azure environments.

Highly technical, hands-on role focusing on LogicMonitor, scripting, and reliability engineering.

 

Key Responsibilities:

Architect, implement, and optimize monitoring using LogicMonitor (dashboards, alerting, custom collectors, integrations).

Build automation for device onboarding, configuration, and data collection using Python, PowerShell, and JavaScript.

Develop data analysis solutions and reports using T-SQL; enable actionable insights from metrics, logs, and events.

Integrate monitoring with ITSM/ChatOps tools (e.g., ServiceNow, PagerDuty, Teams/Slack).

Create technical documentation, runbooks, presentations, and architecture flow diagrams.

Troubleshoot complex issues; perform root-cause analysis and drive fixes across infra and apps.

Partner with cross-functional teams (Infra, Cloud, Security, App Dev) to improve reliability and performance.

Establish best practices for observability, alert hygiene, SLOs/SLIs, and incident response.

Be open to engage himself on long R&D to develop a solution.

 

Required Skills & Experience:

5+ years in software/automation/observability engineering roles.

Strong proficiency: Python, PowerShell, JavaScript, T-SQL.

LogicMonitor (or similar) hands-on experience: collectors, datasources, API, alerting, dashboards.

Windows Server architecture: AD/LDAP, DNS/DHCP, IIS, WMI/WinRM, Event Logs, perf counters, SQL Server basics.

Azure infrastructure: VM scale sets, Storage, Networking, App Services, Azure Monitor/Log Analytics, App Insights.

CI/CD fundamentals; version control (Git); REST APIs; JSON/YAML.

Excellent debugging skills; systematic problem-solving; performance tuning.

 

Preferred/Bonus:

Experience with Azure automation (Functions, Runbooks), Infrastructure as Code (Bicep/ARM/Terraform).

Observability ecosystem: Prometheus/Grafana, OpenTelemetry, KQL, Splunk.

SRE practices: incident management, postmortems, SLOs.

Security/compliance awareness (least privilege, secrets management).

Visualization tools (Visio, Lucidchart, Draw.io).

 

Soft Skills:

Excellent communication

Quick learner with a growth mindset and ownership mentality.

Team player with strong communication and collaboration skills.

Loves debugging, fixing bugs, and improving systems continuously.

Clear technical writing and presentation abilities.

 

Education:

Bachelor’s in Computer Science or equivalent experience





Share Job