Filtros
Fecha de publicación
Experiencia laboral
Tipo de empleo
5 vacantes

Trabajo en

5 vacantes
Recibe ofertas de empleo por email.
Resultados de la búsqueda:

SOFTWARE ENGINEER, AI (JAVA) - I792

**software engineer, ai — code evaluation & training (remote)** **list of accepted countries and locations** help train large-language models (llms) to write production-grade code across a wide range of programming languages: - ** compare & rank multiple code snippets**, explaining which is best and why. - ** repair & refactor ai-generated code** for correctness, efficiency, and style. - ** inject feedback** (ratings, edits, test results) into the rlhf pipeline and keep it running smoothly. **end result**: the model learns to propose, critique, and improve code the way _you_ do. **rlhf in one line** generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. **what you’ll need**: - ** 4+ years of professional software engineering experience** in **java** (constraint programming experience is a bonus, but not required) - ** strong code-review instincts**—you can spot logic errors, performance traps, and security issues quickly. - ** extreme attention to detail and excellent written communication skills.** much of this role involves explaining _why_ one approach is better than another. this cannot be overstated. - you **enjoy reading documentation and language specs** and thrive in an asynchronous, low-oversight environment. **what you don’t need**: - no prior rlhf (reinforcement learning with human feedback) or ai training experience. - no deep machine learning knowledge. if you can review and critique code clearly, we’ll teach you the ...


SOFTWARE ENGINEER, AI (JAVA) (L906)

**software engineer, ai — code evaluation & training (remote)** **list of accepted countries and locations** help train large-language models (llms) to write production-grade code across a wide range of programming languages: - ** compare & rank multiple code snippets**, explaining which is best and why. - ** repair & refactor ai-generated code** for correctness, efficiency, and style. - ** inject feedback** (ratings, edits, test results) into the rlhf pipeline and keep it running smoothly. **end result**: the model learns to propose, critique, and improve code the way _you_ do. **rlhf in one line** generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. **what you’ll need**: - ** 4+ years of professional software engineering experience** in **java** (constraint programming experience is a bonus, but not required) - ** strong code-review instincts**—you can spot logic errors, performance traps, and security issues quickly. - ** extreme attention to detail and excellent written communication skills.** much of this role involves explaining _why_ one approach is better than another. this cannot be overstated. - you **enjoy reading documentation and language specs** and thrive in an asynchronous, low-oversight environment. **what you don’t need**: - no prior rlhf (reinforcement learning with human feedback) or ai training experience. - no deep machine learning knowledge. if you can review and critique code clearly, we’ll teach you the ...


SOFTWARE ENGINEER, AI (JAVA) | NEP007

**software engineer, ai — code evaluation & training (remote)** **list of accepted countries and locations** help train large-language models (llms) to write production-grade code across a wide range of programming languages: - ** compare & rank multiple code snippets**, explaining which is best and why. - ** repair & refactor ai-generated code** for correctness, efficiency, and style. - ** inject feedback** (ratings, edits, test results) into the rlhf pipeline and keep it running smoothly. **end result**: the model learns to propose, critique, and improve code the way _you_ do. **rlhf in one line** generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. **what you’ll need**: - ** 4+ years of professional software engineering experience** in **java** (constraint programming experience is a bonus, but not required) - ** strong code-review instincts**—you can spot logic errors, performance traps, and security issues quickly. - ** extreme attention to detail and excellent written communication skills.** much of this role involves explaining _why_ one approach is better than another. this cannot be overstated. - you **enjoy reading documentation and language specs** and thrive in an asynchronous, low-oversight environment. **what you don’t need**: - no prior rlhf (reinforcement learning with human feedback) or ai training experience. - no deep machine learning knowledge. if you can review and critique code clearly, we’ll teach you the ...


SOFTWARE ENGINEER, AI (JAVA) | [HX-084]

**software engineer, ai — code evaluation & training (remote)** **list of accepted countries and locations** help train large-language models (llms) to write production-grade code across a wide range of programming languages: - ** compare & rank multiple code snippets**, explaining which is best and why. - ** repair & refactor ai-generated code** for correctness, efficiency, and style. - ** inject feedback** (ratings, edits, test results) into the rlhf pipeline and keep it running smoothly. **end result**: the model learns to propose, critique, and improve code the way _you_ do. **rlhf in one line** generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. **what you’ll need**: - ** 4+ years of professional software engineering experience** in **java** (constraint programming experience is a bonus, but not required) - ** strong code-review instincts**—you can spot logic errors, performance traps, and security issues quickly. - ** extreme attention to detail and excellent written communication skills.** much of this role involves explaining _why_ one approach is better than another. this cannot be overstated. - you **enjoy reading documentation and language specs** and thrive in an asynchronous, low-oversight environment. **what you don’t need**: - no prior rlhf (reinforcement learning with human feedback) or ai training experience. - no deep machine learning knowledge. if you can review and critique code clearly, we’ll teach you the ...


FZ711 | SOFTWARE ENGINEER, AI (JAVA)

**software engineer, ai — code evaluation & training (remote)** **list of accepted countries and locations** help train large-language models (llms) to write production-grade code across a wide range of programming languages: - ** compare & rank multiple code snippets**, explaining which is best and why. - ** repair & refactor ai-generated code** for correctness, efficiency, and style. - ** inject feedback** (ratings, edits, test results) into the rlhf pipeline and keep it running smoothly. **end result**: the model learns to propose, critique, and improve code the way _you_ do. **rlhf in one line** generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. **what you’ll need**: - ** 4+ years of professional software engineering experience** in **java** (constraint programming experience is a bonus, but not required) - ** strong code-review instincts**—you can spot logic errors, performance traps, and security issues quickly. - ** extreme attention to detail and excellent written communication skills.** much of this role involves explaining _why_ one approach is better than another. this cannot be overstated. - you **enjoy reading documentation and language specs** and thrive in an asynchronous, low-oversight environment. **what you don’t need**: - no prior rlhf (reinforcement learning with human feedback) or ai training experience. - no deep machine learning knowledge. if you can review and critique code clearly, we’ll teach you the ...


Boletín de vacantes

Cree una alerta de empleo y reciba nuevas ofertas que se adaptan a su perfil desde más de 2550 sitios web de empleo

Puede darse de baja en cualquier momento.
trabajosonline.net © 2017–2021
Más información