Senior R&D Engineer (GUI Autonomous Agents)
Senior R&D Engineer (GUI Autonomous Agents)
Job Details
Vacancies
1 position
Experience Required
No experience required
Job Description
About Agentic Labs
Agentic Labs is at the forefront of the next frontier in Artificial Intelligence:
Action
. While LLMs have mastered text, we are building intelligent agents capable of perceiving, reasoning, and executing complex tasks across graphical user interfaces (GUIs). Our mission is to bridge the gap between digital intent and execution, creating agents that use computers just as humans do.
The Role
We are looking for an exceptional
R&D
Engineer
to join our core team in Singapore. In this role, you will lead the research and development of multimodal AI agents capable of understanding UI layouts, interpreting visual context, and performing accurate actions (clicks, types, scrolls) across Android environments.
You will work at the intersection of Computer Vision, Natural Language Processing, and Reinforcement Learning, translating state-of-the-art research into robust, deployable agents.
Key Responsibilities
- Core Research & Algorithm Design: Design and train Large Multimodal Models (LMMs) specifically optimized for GUI grounding, screen parsing, and action prediction.
- Agent Framework Development: Build end-to-end pipelines for autonomous agents, including environment observation, planning, critique, and execution loops.
- State-of-the-Art Implementation: Keep abreast of the latest advancements in GUI agents (e.g., DroidRun、Surfer 2、AutoGLM). Reproduce, improve, and adapt SOTA methods for our specific use cases.
- Evaluation & Benchmarking: Design rigorous evaluation frameworks to measure agent success rates, reliability, and safety in real-world software environments.
- Publications & IP: Contribute to the company’s intellectual property portfolio and publish findings in top-tier conferences or journals when applicable.
Requirements
- Education: Ph.D. or Master’s degree in Computer Science, Artificial Intelligence, Robotics, or a related field.
- Publication Record: A proven track record of research publications in top-tier AI/ML conferences (e.g., CVPR, NeurIPS, ICLR, ICML, ACL, AAAI, CHI, etc.). Please include a link to your Google Scholar profile or portfolio in your application.
- Coding Skills: Strong software engineering skills in Python, with experience in large-scale model training and data processing.
- Fluent Chinese communication skills enabling close collaboration with China-based teams
Preferred Qualifications (Bonus)
- Experience specifically with GUI Agents.
- Background in Reinforcement Learning (RLHF, PPO) for agent fine-tuning.
- Experience evaluating agents on benchmarks like World of Bits, MiniWob++, or OSWorld.
- History of contributing to open-source AI projects.
- Deep understanding of Vision-Language Models (VLMs) and Transformer architectures.
- Proficiency in deep learning frameworks (PyTorch or JAX).
- Experience with Object Detection, OCR, or UI Understanding tasks.
Why Join Agentic Labs?
- Impact: Work on 0-to-1 technology that fundamentally changes Human-Computer Interaction (HCI).
- Culture: A research-driven environment that values scientific rigor and engineering excellence.
- Compensation: Competitive salary package commensurate with experience and technical depth.
Similar Jobs
Service Technician (Electronic)
Senior Optics Engineer (Semiconductor)
Foundry Engineer (Casting Process)
Technical Support Engineer
Import Customer Service Officer (1 year contract) (up to $2600) (Changi) (freight forwarder)
Response Reality Check
XG TECH PTE. LTD.
Ready to Apply?
This is a direct application to XG TECH PTE. LTD.. No recruitment agencies involved.
Apply for this PositionResponse rate not available - Direct application to employer