• Research Scientist - Efficient Audio Visual Machine Learning

    MetaRedmond, WA 98073

    Job #2752577638

  • Summary:

    At Meta's Reality Labs Research, our goal is to make world-class consumer virtual, augmented, and mixed reality experiences. Come work alongside industry-leading scientists and engineers to create the technology that makes VR and AR pervasive and universal. Join the adventure of a lifetime as we make science fiction real and change the world. We are a world-class team of researchers and engineers creating the future of augmented and virtual reality, which together will become as universal and essential as smartphones and personal computers are today. And just as personal computers have done over the past 45 years, AR and VR will ultimately change everything about how we work, play, and ~~~ are developing all the technologies needed to enable breakthrough Smartglasses, AR glasses and VR headsets, including optics and displays, computer vision, audio, graphics, brain-computer interfaces, haptic interaction, eye/hand/face/body tracking, perception science, and true telepresence. Some of those will advance much faster than others, but they all need to happen to enable AR and VR that are so compelling that they become an integral part of our lives.The Audio team within RL Research is looking for an experienced and innovative Research Scientist with a specialty in real-time and efficient audio-visual learning and machine learning to join our growing team. You will be doing core and applied research in technologies that improve listener's hearing abilities under challenging listening conditions using wearable computing, and alongside a team of dedicated researchers, developers, and engineers. You will operate at the intersection of egocentric perception, acoustics, computer vision, and signal processing algorithms with hardware and software co-design.

    Required Skills:

    Research Scientist - Efficient Audio Visual Machine Learning Responsibilities:

    1. Develop novel AI algorithms and associated real-time systems for source tracking, source localization, source diarization, and relevant semantic scene understanding with application into egocentric wearable computing in AR and VR.

    2. Design and develop efficient AI frameworks and real-time technical systems with constraints on low-compute, low-power and overall system latency.

    3. Lead the development of systems and methods to enable quick prototyping, proof of concept, or proof-of-experience and demonstrations.

    4. Contribute to datasets designs and large-scale data processing for real-time evaluations of efficient audio-visual machine learning methods.

    5. Contribute to the technical strategy and establish new execution methods where relevant for efficient compute driven AI systems in Audio AR and VR applications.

    6. Summarize technical findings to cross-org collaborators, and influence system design and integration decisions of multi-modal AI systems supporting hearing technologies in AR and VR.

    Minimum Qualifications:

    Minimum Qualifications

    1. PhD degree or equivalent experience in Deep Learning, Artificial Intelligence, Machine Learning, Computer Science, Robotics, Computer Vision, Computational Neuroscience, Signal Processing, Speech and Language technologies, or a related field..

    2. 4+ years of experience working on applied computer vision methods for wearable computing.

    3. 2+ years of experience working on efficient multimodal machine learning algorithms for low-compute and low-power devices.

    4. Research-oriented software engineering skills, including fluency with machine learning (e.g., PyTorch, TensorFlow, Scikit-learn, Pandas) and libraries for scientific computing (e.g. SciPy ecosystem).

    5. Experience with cross-group and cross-cultural collaboration.

    6. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.

    Preferred Qualifications:

    Preferred Qualifications

    1. 8+ years of experience working on core and applied computer vision methods.

    2. Experience with real-time AI modeling and systems design for wearable computing.

    3. 3+ years of experience working on audio-visual and multi-modal learning methods for egocentric perception.

    4. Experience with real-time statistical modeling including heuristics driven computer vision methods for egocentric data processing.

    5. Experience developing end-to-end ML pipelines, including dataset design, dataset preprocessing, model development and evaluation, and software integration into platforms.

    6. Experience bridging and adopting machine learning systems from research into potential tech-transferable packages for production.

    7. Experience with large-scale or distributed cluster computing for training, development and offline inference of machine learning models.

    8. Experience with interdisciplinary and/or cross-cultural collaboration with domain researchers in speech processing, auditory perception, psychoacoustics or related.

    Public Compensation:

    $213,000/year to $293,000/year + bonus + equity + benefits

    Industry: Internet

    Equal Opportunity:

    Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

    Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at ~~~.