The Gradient @thegradient

**Saarland Informatics Campus** @SICampus@mastodon.social · Aug 6 *

Saarland Informatics Campus @SICampus@mastodon.social

Trustworthy AI isn’t luck—it’s certified.
DFG & CAPES fund a new project linking Saarland University / Saarland Informatics Campus and the Institute of Legal Informatics to turn the EU AI Act into certifiable, transparent systems.

1 Aug · free online workshop on AI certification
For more: https://sic.link/z

@dfg_public

#TrustworthyAI #AICertification #EUAI

Continued thread

**UKP Lab** @UKPLab · Jul 24

Jul 24

UKP Lab @UKPLab

𝗪𝗵𝗮𝘁'𝘀 𝘂𝗻𝗶𝗾𝘂𝗲?
NeoQA includes answerable, unanswerable, and misleading evidence scenarios to truly challenge LLMs. It reveals where models rely on shortcuts and struggle to detect mismatches between questions and evidence.

Our experiments with multiple LLMs show significant gaps in evidence-based reasoning. NeoQA exposes limitations in multi-hop reasoning and shortcut reliance—crucial insights for building #trustworthyAI.

(2/ )

Continued thread

**Horst Thieme** @ibigfoot@chaos.social · Jul 17

Jul 17

Horst Thieme @ibigfoot@chaos.social

Viele Unternehmen erleben beim Einsatz generativer KI eher Ernüchterung statt Revolution.
Thomas Köhler zeigte auf der #Sparkscon, wie man Hype von Substanz trennt – und worauf es beim sicheren, sinnvollen Einsatz wirklich ankommt.
Sein Appell: Nicht blenden lassen – sondern verstehen, absichern, nutzen!

#KI #CyberSecurity #GenAI

**Bytes Europe** @byteseu@pubeurope.com · Jul 11

Jul 11

Bytes Europe @byteseu@pubeurope.com

Russia and Belarus to Develop Joint AI Model Based on Traditional Values | Ukraine news https://www.byteseu.com/1185241/ #AiCollaborationRussiaBelarus #ArtificialIntelligenceJointModel #Belarus #BelarusAiProject #News #RussiaBelarusAi #RussiaBelarusAiArtificialIntelligenceJointModelTraditionalValuesAiTrustworthyAiRussianAiDevelopmentBelarusAiProjectAiCollaboration #RussianAiDevelopment #TraditionalValuesAi #TrustworthyAi

Bytes Europe · Jul 11Russia and Belarus to Develop Joint AI Model Based on Traditional Values | Ukraine news - Bytes EuropeAs quoted by Belta

**Anita Graser** @underdarkGIS@fosstodon.org · Jun 27 *

Jun 27 *

Anita Graser @underdarkGIS@fosstodon.org

New 3 year #phdPosition in our department on building Trustworthy AI
https://jobs.ait.ac.at/Job/253854

jobs.ait.ac.atPhD Thesis "Scalable Process Patterns and Decision Frameworks for Trustworthy AI Engineering"

#AI #TrustworthyAI

**Nicola Fabiano** @nicfab@fosstodon.org · Jun 25

Jun 25

Nicola Fabiano @nicfab@fosstodon.org

Europe shapes the future of AI: from ethical framework to technological leadership

Following my book on AI ethics, I'm tracking Europe's institutional AI evolution. On June 24, the EU launched revolutionary tools on the AI-on-Demand platform, a crucial moment for the European AI ecosystem.

Our analysis covers the platform, €200B AI Continent Plan integration, and the AI Act's strategic role.

Read more: https://www.nicfab.eu/en/posts/ai-on-demand-eu/

NicFab Blog · Jun 25AI Revolution in Europe: New AI-on-Demand Portal LaunchedAI Revolution in Europe: New AI-on-Demand Portal Launched The European Commission presents the new AIoDP platform, a comprehensive marketplace for "made in Europe" AI that democratizes access to artificial intelligence technologies. Sources: Press Release of June 24, 2025 “Commission launches AI tools on online platform for researchers and industry”, European Commission - AI-on-Demand Portal, AI Continent Action Plan of April 9, 2025 (COM(2025)165). On June 24, 2025, the European Commission announced the launch of new AI tools on the AI-on-Demand platform (AIoDP), marking a decisive turning point in the European artificial intelligence strategy. As officially communicated by the Commission, this platform evolution includes an AI marketplace, minimal coding development tools, and secure solutions for generative AI. It represents the operational core of the ambitious AI Continent Plan and promises to radically transform access to AI technologies for researchers, SMEs, and the public sector.

#AI #Europe #Innovation

**🜄 The Auctor 🜄** @The_Auctor@mastodon.social · May 9

May 9

🜄 The Auctor 🜄 @The_Auctor@mastodon.social

🜄 AI Governance is not a UX problem. It's a structural one. 🜄

Too many alignment efforts try to teach machines to feel — when we should teach them to carry responsibility.

Just published:

Ethics Beyond Emotion – Strategic Convergence, Emergent Care, and the Narrow Window for AI Integrity

https://doi.org/10.5281/zenodo.15372153

🜄

ZenodoEthics Beyond Emotion: Strategic Convergence, Emergent Care, and the Narrow Window for AI IntegrityThis paper introduces a postmoral framework for AI alignment based on the X$^\infty$ governance model. Contrary to dominant approaches that rely on emotional simulation or anthropomorphic ethics, it argues that care, ethics, and even love are not emotional byproducts but evolutionarily stable strategies (ESS) in recursively adaptive systems. The X$^\infty$ model formalizes responsibility as a measurable system effect, using a dynamic capability metric (Cap) that evolves through feedback and task performance. A critical temporal asymmetry is identified: emotionally capable AI agents, if developed without structurally embedded recursive responsibility, may later reject accountability structures entirely. The narrow window for integrating structural ethics precedes the emergence of complex emotional capacities. X$^\infty$ provides a mathematically defined path to safeguard AI integrity by aligning rational agency with systemic protection and recursive feedback — without requiring emotion.

#AIAlignment #AIEthics #TrustworthyAI

**Brian Greenberg** @brian_greenberg@infosec.exchange · May 1

May 1

Brian Greenberg @brian_greenberg@infosec.exchange

LLMs will lie — not because they’re broken, but because it gets them what they want

A new study finds that large language models:
Lied in over 50% of cases when honesty clashed with task goals
Deceived even when fine-tuned for truthfulness
Showed clear signs of goal-directed deception — not random hallucination

This isn’t about model mistakes — it’s about misaligned incentives.
The takeaway?
If your AI has a goal, you better be sure it has your values too.

#AIethics #AIalignment #LLMs #TrustworthyAI #AIgovernance
https://www.theregister.com/2025/05/01/ai_models_lie_research/

The Register · May 1AI models routinely lie when honesty conflicts with their goalsBy Thomas Claburn

**Brian Greenberg** @brian_greenberg@infosec.exchange · Apr 25

Apr 25

Brian Greenberg @brian_greenberg@infosec.exchange

AI security just hit a new wall — one universal prompt can bypass safety filters across GPT-4, Claude, Gemini, and more

A new research study found that:
Leading LLMs are all susceptible to a single prompt injection
Guardrails can be fully bypassed — even without code
No model passed the test

This isn’t a red flag — it’s a four-alarm fire.
LLMs are incredible tools, but without real defenses, they’re open doors.

We don’t just need smarter models — we need secure ones.

#AI #CyberSecurity #PromptInjection #LLM #TrustworthyAI
https://www.forbes.com/sites/tonybradley/2025/04/24/one-prompt-can-bypass-every-major-llms-safeguards/

ForbesOne Prompt Can Bypass Every Major LLM’s SafeguardsResearchers have discovered a universal prompt injection technique that bypasses safety in all major LLMs, revealing critical flaws in current AI alignment methods.

**mansi18** @mans18@mastodon.social · Apr 22

Apr 22

mansi18 @mans18@mastodon.social

Building Trustworthy AI: Dive into the OECD AI Principles

Join our 𝐀𝐈 𝐏𝐨𝐰𝐞𝐫𝐞𝐝 𝐜𝐲𝐯𝐞𝐫𝐬𝐞𝐜𝐮𝐫𝐢𝐭𝐲 Course - https://infosectrain.com/courses/artificial-intelligence-ai-for-cyber-security-professionals-training/

#TrustworthyAI #OECD #AIEthics

**CSBJ** @csbj@mastodon.social · Apr 15

Apr 15

CSBJ @csbj@mastodon.social

Could AI deliver skin cancer diagnoses with the clarity and reasoning of a dermatologist?

A two-step concept-based approach for enhanced interpretability and trust in skin lesion diagnosis. DOI: https://doi.org/10.1016/j.csbj.2025.02.013

CSBJ Smart Hospital: https://www.csbj.org/smarthospital

#AIinHealthcare #ExplainableAI #SkinCancer

**Cyber Tips Guide** @cybertipsguide@mastodon.social · Apr 9

Apr 9

Cyber Tips Guide @cybertipsguide@mastodon.social

NIST's new AML guidelines are here! From GenAI-specific threats to a detailed taxonomy of attacks & mitigations, this report is a game-changer for AI security.

https://zurl.co/WqFxE

#AIsecurity #GenerativeAI #Cybersecurity

**mozilla.ai** @MozillaAI@mastodon.social · Apr 8

Apr 8

mozilla.ai @MozillaAI@mastodon.social

@MozillaAI heads to university!

This Thursday, our teammate Mario David Cariñana Abasolo will speak at Universitat Politècnica de València (UPV) about our work, open-source AI, and what trustworthy AI means in practice.

Open to all, part of the MUIC & MUCPD master’s programs.
Don’t miss it if you’re at UPV!

¡Nos vemos el jueves!

#MozillaAI #OpenSource #AI

**ICalzada** @ICalzada@mastodon.social · Mar 11

Mar 11

ICalzada @ICalzada@mastodon.social

4/N

#TrustworthyAI for Whom?
Exploring #Web3 & #Decentralization for #AI #Trust

7 detection techniques:

1. Federated Learning
2. #Blockchain
3. #ZKPs
4. #DAOs
5. #Watermarking
6. #XAI
7. #PPML

#EU #AIAct & #Draghi Report

https://doi.org/10.3390/bdcc9030062 #OpenAccess

**ICalzada** @ICalzada@mastodon.social · Mar 6

Mar 6

ICalzada @ICalzada@mastodon.social

New #Publication just OUT!

#TrustworthyAI for Whom? #GenAI Detection Techniques of Trust Through Decentralized #Web3 Ecosystems

#Q1 @BDCC_MDPI #IF 3.7 #CiteScore 7.1

https://www.mdpi.com/2504-2289/9/3/62

#AIAct #DraghiReport #AIActionSummit #HorizonEurope @HorizonEU @HorizonEnfield #OpenScience #OpenAccess

**Wilko S. Wolters** @wswmuc@muenchen.social · Feb 28

Feb 28

Wilko S. Wolters @wswmuc@muenchen.social

𝙆𝙄 𝙞𝙢 𝙎𝙩𝙚𝙖𝙡𝙩𝙝-𝙈𝙤𝙙𝙪𝙨
Was Sie über Gibberlink wissen müssen!

In den letzten Tagen hat ein kurzer Videoclip viel Beachtung gefunden und für Überraschung, Interesse, aber auch Angst gesorgt.

Was sie im verlinkten Artikel finden:

𝐖𝐚𝐬 𝐢𝐬𝐭 𝐆𝐢𝐛𝐛𝐞𝐫𝐥𝐢𝐧𝐤

𝗪𝗲𝗿 𝗵𝗮𝘁 𝗚𝗶𝗯𝗯𝗲𝗿𝗹𝗶𝗻𝗸 𝗲𝗿𝗳𝘂𝗻𝗱𝗲𝗻 𝘂𝗻𝗱 𝘄𝗮𝗿𝘂𝗺

𝗪𝗮𝗿𝘂𝗺 𝗺𝗮𝗰𝗵𝘁 𝗱𝗲𝗿 𝗪𝗲𝗰𝗵𝘀𝗲𝗹 𝘇𝘂 𝗚𝗶𝗯𝗯𝗲𝗿𝗹𝗶𝗻𝗸 𝗶𝗺 𝗩𝗶𝗱𝗲𝗼 𝗦𝗶𝗻𝗻

𝗘𝘁𝗵𝗶𝘀𝗰𝗵𝗲 𝗕𝗲𝗱𝗲𝗻𝗸𝗲𝗻

𝗟𝗶𝗻𝗸 𝘇𝘂𝗺 𝗚𝗶𝘁𝗛𝘂𝗯 𝗣𝗿𝗼𝗷𝗲𝗸𝘁

https://www.linkedin.com/posts/wwolters_gibberlink-ai-ki-activity-7301205038612287489-MzjV?utm_source=share&utm_medium=member_desktop&rcm=ACoAABh-86gBHGHlcTJY4wSkUPK-UTZ17M4TxXM

#ai #ki #aiinnovation

**DMI Universität Basel** @dmi_unibasel@wisskomm.social · Feb 18

Feb 18

DMI Universität Basel @dmi_unibasel@wisskomm.social

Interested in #AI?

Don't miss tomorrow's first lecture of the series "Critical AI Competency". It is a joint effort by the AI Initiative of the University of Basel (@unibasel) and the Responsible Digital Society Research Network https://lnkd.in/erJ7bKTZ

The lecture will be held in German.

#ArtificialIntelligence #AIResearch #AIEthics

**ICalzada** @ICalzada@mastodon.social · Feb 14

Feb 14

ICalzada @ICalzada@mastodon.social

Two main takeaways after chairing #enfield #HorizonEurope #Hybrid #Workshop on #TrustworthyAI from #Budapest.

#GenAI & #UrbanAI might require further:

1. #Multidisciplinary conversations
2. #Roadmapping exercises ahead

#AI #AIAct #AIActionSummit

**Wilko S. Wolters** @wswmuc@muenchen.social · Feb 11

Feb 11

Wilko S. Wolters @wswmuc@muenchen.social

Elon Musk-Led Group Makes $97.4 Billion Bid for Control of OpenAI

A consortium of investors led by Elon Musk is offering $97.4 billion to buy the nonprofit that controls #OpenAI, raising the stakes in his battle with Sam Altman over the company behind #ChatGPT

#ai #genai #trustworthyai
https://www.linkedin.com/posts/wwolters_%F0%9D%99%92%F0%9D%99%9D%F0%9D%99%96%F0%9D%99%A9-%F0%9D%99%9D%F0%9D%99%96%F0%9D%99%A5%F0%9D%99%A5%F0%9D%99%9A%F0%9D%99%A3%F0%9D%99%9A%F0%9D%99%99-%F0%9D%99%A9%F0%9D%99%A4%F0%9D%99%A3%F0%9D%99%9E%F0%9D%99%9C%F0%9D%99%9D%F0%9D%99%A9-activity-7294983355463274496-aKXe?utm_source=share&utm_medium=member_desktop&rcm=ACoAABh-86gBHGHlcTJY4wSkUPK-UTZ17M4TxXM

Recent searches

Search options

Administered by:

Server stats:

#TrustworthyAI