#robustness — Bluesky Posts — bluesky.baby

Profile Explorer

Home New Trending Search

About Privacy Terms

#

#robustness

Posts tagged #robustness on Bluesky

Sara Magliacane hiring PhDs at UvA

@smaglia.bsky.social

2 days ago

2 PhD Positions on Learning Causally Grounded Concepts for Safe AI Are you interested in improving the interpretability, robustness and safety of AI by integrating causal reasoning? The Causality team in the AMLab group at the University of Amsterdam is looking for 2...

🚨2 PhD positions with me @amlab.bsky.social on learning causally grounded concepts 🚨

Are you interested in improving the #interpretability #robustness and #safety of AI by integrating #causal reasoning? Join us in beautiful Amsterdam 🇳🇱🌷🚲

Deadline: 20 April

www.academictransfer.com/en/jobs/3593...

15 9 0 0

⛵Emmanuel Bethoux ⚓

@emmanuelbethoux.bsky.social

4 days ago

Course au Large 2030 - Accueil

6/6 Tomorrow, teams like Banque Populaire or SVR will be judged on their ability to sail under constraints. Sponsors are becoming transition partners. The 2030 Gold Fleet will be those who mastered their footprint before their final layline. 🏆 #OceanRacing #SailingBusiness #Robustness #Sponsorship

0 0 0 0

@shinyandiknowit.bsky.social

2 weeks ago

#Resolve #Resillience #Robustness #ReadItSomewhere

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 weeks ago

The Geometry of Algorithmic Stability: A Hodge Theoretic View on Structural vs. Statistical Insta...

Karen Sargsyan

Action editor: Alberto Bietti

https://openreview.net/forum?id=rFqsgVXZYO

#robustness #stability #instability

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 weeks ago

Adversarial Vulnerability from On-Manifold Inseparability and Poor Off-Manifold Convergence

Rajdeep Haldar, Yue Xing, Qifan Song, Guang Lin

Action editor: Olivier Cappé

https://openreview.net/forum?id=pa90uRZATF

#adversarial #robustness #classification

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

1 month ago

End-to-End Conformal Calibration for Optimization Under Uncertainty

Christopher Yeh, Nicolas Christianson, Alan Wu, Adam Wierman, Yisong Yue

Action editor: Jake C. Snell

https://openreview.net/forum?id=yM8qkT0f9H

#optimization #robustness #optimize

1 0 0 0

Michael Beyeler

@mbeyeler.bsky.social

1 month ago

BIRD: Behavior Induction via Representation-structure Distillation Human-aligned deep learning models exhibit behaviors consistent with human values, such as robustness, fairness, and honesty. Transferring these behavioral properties to models trained on different ta...

Most transfer learning assumes shared data, tasks, or domains.

BIRD shows you can transfer behavior itself even when those assumptions break.

All details here:
arxiv.org/abs/2505.23933

#KnowledgeDistillation #Robustness #MachineLearning #AIResearch #ResponsibleAI

0 0 0 0

Michael Beyeler

@mbeyeler.bsky.social

1 month ago

Two-panel schematic illustrating the BIRD framework. Left panel shows independent pre-training of a teacher and a student network on different datasets, each optimized with its own task loss. Right panel shows representation-structure distillation: selected intermediate layers from teacher and student are compared via a representation loss, which aligns the geometry of their internal activations while the student is still trained on its own task loss. A snowflake icon indicates the teacher is frozen. The diagram emphasizes that behavior is transferred by aligning internal representation structure rather than outputs or shared data.

Two-panel schematic illustrating the BIRD framework. Left panel shows independent pre-training of a teacher and a student network on different datasets, each optimized with its own task loss. Right panel shows representation-structure distillation: selected intermediate layers from teacher and student are compared via a representation loss, which aligns the geometry of their internal activations while the student is still trained on its own task loss. A snowflake icon indicates the teacher is frozen. The diagram emphasizes that behavior is transferred by aligning internal representation structure rather than outputs or shared data.

We introduce BIRD: Behavior Induction via Representation-structure Distillation.

Instead of transferring outputs, BIRD aligns the geometry of internal representations between teacher and student, enabling weak → strong generalization.

#KnowledgeDistillation #TransferLearning #Robustness

0 0 1 0

TMLR Published Papers

@tmlr-pub.bsky.social

1 month ago

Robust Reinforcement Learning in a Sample-Efficient Setting

Siemen Herremans, Ali Anwar, Siegfried Mercelis

Action editor: Marcello Restelli

https://openreview.net/forum?id=iij6nLYLjF

#reinforcement #robustness #robust

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

1 month ago

Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes

David Mark Bossens, Atsushi Nitanda

Action editor: Alberto Maria Metelli

https://openreview.net/forum?id=tmfdqtFUqO

#adversarial #robustness #optimise

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

1 month ago

Consistency Aware Robust Learning under Noisy Labels

Fahad Sarfraz, Bahram Zonooz, Elahe Arani

Action editor: Yu Yao

https://openreview.net/forum?id=pZulfLkARr

#robust #consistency #robustness

0 0 0 0

@ccahua.bsky.social

1 month ago

Digital Red Queen: Adversarial Program Evolution in Core War with LLMs A self-play algorithm that uses LLMs to evolve adversarially competing programs in Core War

When LLMs masturbate in #DRQ safe spaces, they maintain fitness pub.sakana.ai/drq/ #Sandbox #CoreWar #DigitalRedQueen #Generalists #ObjectiveShifting #Adaptation #Evolution #Robustness

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

2 months ago

Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

PANKAJ KUMAR, Subhankar Mishra

Action editor: Aditya Menon

https://openreview.net/forum?id=Bchvaaod6g

#robustness #nlp #adversarial

0 0 0 0

@richb118.bsky.social

2 months ago

Stories About Control: Why Optimisation So Often Produces Fragility – Traders Outpost “What looked like control was only stability borrowed from the past.” The feeling of control is not the same thing as control. The traders most vulnerable to catastrophic failure are rarely naive. …

Stories About Control: Why Optimisation So Often Produces Fragility

👉 Read here:
atstradingsolutions.com/stories-abou...

#StoriesAboutControl #Robustness #Fragility #ProcessOverPrediction

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

2 months ago

Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified?

Olawale Elijah Salaudeen, Nicole Chiou, Shiny Weng, Sanmi Koyejo

Action editor: Ozan Sener

https://openreview.net/forum?id=fNywRyqPQo

#robustness #generalization #benchmarks

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

2 months ago

Rethinking Robustness in Machine Learning: A Posterior Agreement Approach

João B. S. Carvalho, Víctor Jiménez Rodríguez, Alessandro Torcinovich et al.

Action editor: Mohammad Emtiyaz Khan

https://openreview.net/forum?id=Bpc9uZ6kcg

#robustness #adversarial #generalization

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 months ago

New #Featured Certification, #Reproducibility Certification, #J2C Certification:

Robust Reinforcement Learning in a Sample-Efficient Setting

Siemen Herremans, Ali Anwar, Siegfried Mercelis

https://openreview.net/forum?id=iij6nLYLjF

#reinforcement #robustness #robust

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 months ago

New #J2C Certification:

Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes

David Mark Bossens, Atsushi Nitanda

https://openreview.net/forum?id=tmfdqtFUqO

#adversarial #robustness #optimise

0 0 0 0

@taeseokmoon.bsky.social

3 months ago

The most impressive feature of our microbial #consortium is its #robustness. It can efficiently process #mixed #plastic #waste with #fluctuating plastic #compositions, maintaining its capabilities and population balance for 21 days. It's crucial for applications, where compositions of waste vary.

0 0 0 0

TPI Global Climate Transition Centre (TPI Centre) at LSE

@tpicatlse.bsky.social

3 months ago

Carbon Performance for Banks: methodology note v1.0, December 2026

Photo of Manhattan skyline

Carbon Performance for Banks: methodology note v1.0, December 2026 Photo of Manhattan skyline

Our 𝗖𝗮𝗿𝗯𝗼𝗻 𝗣𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗳𝗼𝗿 𝗕𝗮𝗻𝗸𝘀 assessments are guided by key design principles of #transparency, #accountability and #robustness, essential for ensuring the #credibility of the Centre’s assessment process.

Check out the methodology note: www.transitionpathwayinitiative.org/publications...

3 0 0 0

Claire Nicholson

@clairendigital.bsky.social

3 months ago

I’ve been testing a prompt-level operator that acts like a soft control layer for #LLMs.

It produces a 7.4× contraction in behavioural manifolds and suppresses adversarial drift in repeated generations.

Methods + metrics👉 zenodo.org/records/1771...

#AI #PromptEngineering #Robustness #AIEvaluation

2 0 0 0

Global Advisors - Quantified Strategy

@globaladvisors.bsky.social

3 months ago

“The #robustness of people is really staggering.” - Ilya #Sutskever - Safe #Superintelligence

Understanding what Sutskever means by robustness requires examining not just human capabilities but the specific ways in which #AI systems are fragile by comparison... - https://with.ga/fjaz5
#quote

1 0 0 0

@marcwilson1000.bsky.social

3 months ago

“The #robustness of people is really staggering.” - Ilya #Sutskever - Safe #Superintelligence

Understanding what Sutskever means by robustness requires examining not just human capabilities but the specific ways in which #AI systems are fragile by comparison... - https://with.ga/fjaz5
#quote

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 months ago

Certified Robustness to Data Poisoning in Gradient-Based Training

Philip Sosnin, Mark Niklas Mueller, Maximilian Baader, Calvin Tsay, Matthew Robert Wicker

Action editor: Chuan Guo

https://openreview.net/forum?id=9WHifn9ZVX

#robustness #backdoor #attacks

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

4 months ago

Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs

Abhay Sheshadri, Aidan Ewart, Phillip Huang Guo et al.

Action editor: Daphne Ippolito

https://openreview.net/forum?id=6LxMeRlkWl

#adversarial #adversary #robustness

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

4 months ago

AlignFix: Fixing Adversarial Perturbations by Agreement Checking for Adversarial Robustness again...

Ashutosh Kumar Nirala, Jin Tian, Olukorede Fakorede, Modeste Atsague

Action editor: Pin-Yu Chen

https://openreview.net/forum?id=XgK05fssnx

#adversarial #adversarially #robustness

0 0 0 0

Marco Casassa Mont

@marcocasassamont.bsky.social

4 months ago

Launching the AI Model Arena The Defence AI Centre has worked with industry to develop a new tool that will help redefine how Defence evaluates and procures AI technologies.

FYI - the Defence AI Centre is launching the AI Model Arena to help redefine how Defence evaluates and procures artificial intelligence technologies ... www.gov.uk/government/n...
#DAIC #Defence #MOD #AI #AIModelArena #JSP936 #performance #reliability #robustness #security

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

4 months ago

Set-Based Training for Neural Network Verification

Lukas Koller, Tobias Ladner, Matthias Althoff

Action editor: Kuldeep S. Meel

https://openreview.net/forum?id=n0lzHrAWIA

#adversarial #robustness #robust

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

4 months ago

New #J2C Certification:

Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified?

Olawale Elijah Salaudeen, Nicole Chiou, Shiny Weng, Sanmi Koyejo

https://openreview.net/forum?id=fNywRyqPQo

#robustness #generalization #benchmarks

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

4 months ago

Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities

Zora Che, Stephen Casper, Robert Kirk et al.

Action editor: Chuan Sheng Foo

https://openreview.net/forum?id=E60YbLnQd2

#tampering #robustness #attacks

0 0 0 0