Python Job | Вакансии | Стажировки@job_python P.6725

Notice: file_put_contents(): Write of 752 bytes failed with errno=28 No space left on device in /var/www/tgoop/post.php on line 50

Warning: file_put_contents(): Only 16384 of 17136 bytes written, possibly out of free disk space in /var/www/tgoop/post.php on line 50
Python Job | Вакансии | Стажировки@job_python P.6725

JOB_PYTHON Telegram 6725

Python Job | Вакансии | Стажировки

Research Engineer — ML RL Agents
#удаленка
Company: Affine.io
Salary: $150k - $500k
☑️Responsibilities
-Design decentralized RL systems that incentivize miners to train, refine, and host high-quality agentic LLMs on the Bittensor subnet.
-Develop evaluation frameworks to assess model performance, safety, and alignment—including task design, metrics, adversarial testing, and red-teaming.
-Advance RL for agentic models by researching and applying cutting-edge RL and alignment techniques to improve the training–evaluation loop.
-Prototype and scale algorithms: explore new agent architectures and post-training methods, then build reproducible pipelines for finetuning, evaluation, and data flow.
-Contribute to live competitive benchmarks, deploying new approaches in production and ensuring the system rewards genuine intelligence gains rather than gaming.

☑️Requirements
-Reinforcement Learning expertise with deep knowledge and hands-on experience in RL algorithms, design, and tuning. Background in multi-agent systems, mechanism design, or RLHF is a strong plus.
-Strong engineering skills in Python and experience building production-level ML systems with PyTorch, JAX, or TensorFlow.
-Distributed systems experience, with comfort designing and scaling high-performance, reliable infrastructure.
-Knowledge of LLMs and tool use, including how models interact with APIs, external tools, and function calling.
-Advanced academic or practical background: Master’s or PhD in a relevant field, or equivalent applied research and engineering experience.
Contact: https://applicantai.com/affine-io/research-engineer-ml-rl-agents/10570

🔥 Подписаться на наши каналы / @best_itjob / @it_rab

❤1

www.tgoop.com/job_python/6725

2.75K viewsSep 17 at 05:00

tgoop.com/job_python/6725

Create: 2025-09-17
Last Update: 2025-11-18 16:36:44

Research Engineer — ML RL Agents
#удаленка
Company: Affine.io
Salary: $150k - $500k
☑️Responsibilities
-Design decentralized RL systems that incentivize miners to train, refine, and host high-quality agentic LLMs on the Bittensor subnet.
-Develop evaluation frameworks to assess model performance, safety, and alignment—including task design, metrics, adversarial testing, and red-teaming.
-Advance RL for agentic models by researching and applying cutting-edge RL and alignment techniques to improve the training–evaluation loop.
-Prototype and scale algorithms: explore new agent architectures and post-training methods, then build reproducible pipelines for finetuning, evaluation, and data flow.
-Contribute to live competitive benchmarks, deploying new approaches in production and ensuring the system rewards genuine intelligence gains rather than gaming.

☑️Requirements
-Reinforcement Learning expertise with deep knowledge and hands-on experience in RL algorithms, design, and tuning. Background in multi-agent systems, mechanism design, or RLHF is a strong plus.
-Strong engineering skills in Python and experience building production-level ML systems with PyTorch, JAX, or TensorFlow.
-Distributed systems experience, with comfort designing and scaling high-performance, reliable infrastructure.
-Knowledge of LLMs and tool use, including how models interact with APIs, external tools, and function calling.
-Advanced academic or practical background: Master’s or PhD in a relevant field, or equivalent applied research and engineering experience.
Contact: https://applicantai.com/affine-io/research-engineer-ml-rl-agents/10570

🔥 Подписаться на наши каналы / @best_itjob / @it_rab

BY Python Job | Вакансии | Стажировки

Share with your friend now:
tgoop.com/job_python/6725

Open in Telegram

Telegram News

Date: 2025-11-18|

Users are more open to new information on workdays rather than weekends. Activate up to 20 bots Telegram offers a powerful toolset that allows businesses to create and manage channels, groups, and bots to broadcast messages, engage in conversations, and offer reliable customer support via bots. According to media reports, the privacy watchdog was considering “blacklisting” some online platforms that have repeatedly posted doxxing information, with sources saying most messages were shared on Telegram. In the next window, choose the type of your channel. If you want your channel to be public, you need to develop a link for it. In the screenshot below, it’s ”/catmarketing.” If your selected link is unavailable, you’ll need to suggest another option.
from us

Telegram Python Job | Вакансии | Стажировки
FROM American