Cosmon is looking for a Staff Engineer to own the Nexus desktop platform and enterprise deployment experience that every integration and customer deployment relies on. This role sits on the platform team solving native Windows, COM, update, and deployment problems that directly determine onboarding speed, reliability, and whether POCs convert to customers. Your work will shape product reliability across Fortune 100 environments. What you'll do • Own the Nexus desktop app framework, set architecture, and drive runtime decisions. • Build COM and native interop infrastructure including client helpers, lifecycle management, threading, and marshaling. • Design process reliability features such as sandboxing, watchdog supervision, crash isolation, and observability. • Define and implement plugin and extension architecture for versioning, loading, isolation, and independent updates. • Deliver enterprise deployment packages that handle Defender, firewall rules, signing, MSI packaging, Group Policy, SCCM, and Intune edge cases. • Own update and rollback systems, including silent updates, staged rollouts, and air-gapped compatibility. • Partner directly with customer IT/security teams during POCs to resolve installer, Defender, and registration issues. • Drive onboarding time-to-value from download to first successful workflow and remove friction in first-run experiences. • Set and hit MTTR targets for client-reported issues and build triage and debugging infrastructure. • Track and reduce production bug count through severity-weighted metrics and ownership of reliability trends. • Build platform observability (logs, traces, telemetry) that works in locked-down enterprise environments. What Cosmon is looking for • 8+ years building production software, with at least 3 years in a staff or senior IC role owning system-level outcomes. • Deep experience shipping desktop applications into enterprise Windows environments, including MSI, signing, Defender, Group Policy, SCCM/Intune, and firewall issues. • Strong expertise in COM (threading models, marshaling, out-of-process hosting, debugging HRESULTs). • Strong C/C++ skills with working knowledge of Python or TypeScript for tooling and higher-level components. • Experience in process-level reliability engineering (watchdogs, crash recovery, IPC, sandboxing). • Demonstrated production debugging skills with tools like WinDbg, Event Viewer, and MSI verbose logs. • Experience with Windows security tooling (Defender, SmartScreen, EDR, certificate management) and enterprise update systems (Squirrel, MSIX, custom updaters). • Experience operating in air-gapped or heavily network-restricted environments and building plugin/extension architectures. • Exposure to engineering or scientific desktop software (CAD/CAE/PLM) is a plus.
Cosmon shapes the future of computer-aided engineering by building AI that thinks like an engineer. It offers a rare opportunity to join early and make a foundational impact at a company reimagining CAE for the AI era. The company is based in Palo Alto, California, with an address at 531 Howard Street, G/F, San Francisco, CA.
Qualifications
Proficiency in software development, programming languages (e.g., Python, C++, or Java), and algorithm implementation (Bonus) Experience with computer-aided engineering (CAE), computer-aided design (CAD), and product lifecycle management (PLM) tools Strong knowledge of AI, machine learning, and data analysis techniques Ability to work collaboratively with cross-functional teams and communicate technical concepts effectively Understanding of engineering principles in simulation, modeling, and optimization Bachelor’s or advanced degree in Computer Science, Mechanical Engineering, Electrical Engineering, or a related field Problem-solving skills and attention to detail with demonstrated project success Experience working in a fast-paced, startup environment is a plus
Salary
$160,000 - $250,000
Equity
0.5% - 1%
Location
San Francisco, United States