Nathan Claire Group is a leading Information Technology Consulting company that specializes in services supporting digital transformation. We are committed to delivering innovative solutions and cutting-edge technologies to our clients. As an Information Technology Consultant at Nathan Claire Group, you will have the opportunity to gain hands-on experience and work alongside industry professionals in a dynamic and fast-paced environment.
APM Full-Stack Engineer (Java, .NET, & Frontend)
Core Job Responsibilities
- APM Agent Development & Maintenance (Backend Focus)
- Platform Expertise: Design, develop, and maintain APM agents and instrumentation for both Java (JVM-based) and .NET (Core and Framework) applications.
- Performance Engineering: Instrument core application code to capture metrics, traces, and logs with minimaloverhead, focusing on deep-dive component monitoring (DDCM), transaction profiling, and database query tracing.
- Cross-Language Support: Troubleshoot and resolve complex performance issues in mixed-technology environments where Java and .NET services interact.
- Deployment and Service Management: Configure and manage the deployment of agents as system services (e.g., using NSSM, WinSW, or systemd) and troubleshoot installation issues, including security protocols like TLS/SSL.
Frontend Development & Visualization (User Focus)
- Data Presentation: Develop and enhance the user interface (UI) and user experience (UX) for the APM platform\'s dashboards and data visualization tools.
- Real User Monitoring (RUM): Implement and optimize Real User Monitoring (RUM) agents to capture client-side performance metrics, including page load times, AJAX calls, and JavaScript errors.
- FullStack Tracing: Ensure seamless correlation between frontend RUM data and backend Java/.NET transaction traces to provide end-to-end visibility.
- Web Performance Optimization: Optimize the frontend application for maximum speed, scalability, and cross-browser compatibility, often dealing with large data sets and complex charts.
Monitoring, Alerting, & Operations
- Pipeline Management: Work with data pipelines (e.g., Kafka, Elasticsearch) to ensure the efficient ingestion, processing, and storage of high-volume telemetry data (metrics, logs, traces).
- Alerting and Triage: Develop and maintain alert rules, thresholds, and anomaly detection models based on critical business and technical metrics (e.g., latency, error rate, throughput).
- Collaboration: Act as the subject matter expert (SME) for APM, collaborating with DevOps, SRE, and development teams to diagnose production incidents and defineservice level objectives (SLOs).
Required Skills and Qualifications
Core Programming and Frameworks
- Java APM- Deep experience with Java, Golang, JVM internals, and frameworks like Spring Boot or Quarkus. Knowledge of Java Agent specifications (e.g., Byte Buddy) is a major plus.
- .NET APM- Proficiency in C#, .NET Core, ASP.NET, and the .NET CLR , Golang. Familiarity with Reflection and dynamic code generation for instrumentation.
- Frontend- Expert-level HTML5, CSS3, and JavaScript (ES6+). Strong proficiency in at least one modern framework: React.
- Databases-Strong SQL skills (e.g., MSSQL, PostgreSQL, Oracle) and understanding of NoSQL concepts for tracing and querying.
APM & Observability Tools
- Hands-on experience with industry-standard APM tools (e.g., New Relic, Dynatrace, AppDynamics, Datadog).
- Familiarity with opensource observability standards, specifically Open Telemetry (OTEL) and its tracing concepts.
- Experience with log aggregation and analysis platforms (e.g., Elastic Stack/ELK, Splunk).
- In-depth knowledge of HTTP protocols, TLS/SSL security, and network traffic analysis.
Development & Operational Skills
- Version Control: Expert use of Git and collaboration on code repositories (GitHub, GitLab, or Bitbucket).
- CI/CD & DevOps: Experience with continuous integration and delivery pipelines (Jenkins, GitLab CI, Azure DevOps) and cloud platforms (AWS, Azure, or GCP).
- Troubleshooting: Exceptional analytical and problem-solving skills to triage complex, inter-service performance bottlenecks.
Method of Application
Signup to view application details.
Signup Now