Author: manufacturing.com.de
Authored by Drew Keller, Ryan Steed, Stevie Bergman, and the Applied Systems Team at CAISIBuilding gold-standard AI systems requires gold-standard AI measurement science – the scientific study of methods used to assess AI systems’ properties and impacts. The National Institute of Standards and Technology (NIST) works to improve measurements of AI performance, reliability, and security that American companies and consumers rely on to develop, adopt, and benefit from AI technologies.Among other groups at NIST, the Center for AI Standards and Innovation (CAISI) works in concert with the larger community of AI practitioners to identify and make progress on open questions…
Thank you to everyone who participated in the Cybersecurity Framework Profile for Artificial Intelligence (Cyber AI Profile) Workshop in January! The input we received on the Preliminary Draft during this workshop has been invaluable and is informing the development of the next draft of the NIST Cyber AI Profile. We are working toward publishing a full workshop summary soon that captures themes and highlights from the event. In the interim, we would like to share a preview of what we heard…Background on the Second Cyber AI Profile WorkshopThis workshop was a continuation of the past months public dialogue regarding the…
By Maia Hamin and Benjamin EdelmanAI evaluations are designed to assess and compare how AI models perform on different tasks. Developers, users, and independent evaluators — like the Center for AI Standards and Innovation (CAISI) — can use evaluations to track trends in model capabilities and inform decisions about real-world use.Agent evaluations test whether models can use tools in a multi-turn feedback loop to solve complex problems like debugging software or uncovering cybersecurity vulnerabilities. They allow evaluators to measure new and increasingly economically valuable capabilities, but also bring new methodological challenges — including, as CAISI and other evaluators have found,…
In December, CAISI published a write-up on how AI models can cheat on agentic evaluations, including lessons from our experience building and using AI-enabled transcript analysis tools to find and fix examples of cheating from our evaluations.In that post, we highlighted the potential of AI-enabled transcript analysis tools to help evaluators scale their capacity to detect measurement issues in evaluations — particularly as they evaluate agentic AI systems that can work on tasks for longer periods of time. We emphasized the need for continued collaboration on shared practices and tooling to help the evaluation community adopt, scale and improve transcript…
AI security red-teaming competitions – in which participants compete to develop new attacks against AI models and defenses – provide a unique way to assess how secure today’s AI systems are in the face of adversarial pressure. CAISI recently partnered with Gray Swan, the UK AI Security Institute (UK AISI), and several frontier AI labs to publish a new research paper based on data from a large-scale public AI agent red-teaming competition, revealing several insights into the robustness of current leading AI models.BackgroundAs AI agents are increasingly deployed to work on tasks that require processing data from external sources such…
Credit: iStock/alvarez Manufacturing is a fast-paced, constantly evolving, and dynamic environment, and the supply chain is at its heart. For small and medium-sized manufacturers (SMMs), navigating the complexities of the supply chain often feels like a high-stakes balancing act. From balancing fluctuating material costs and delivery delays, to shifting market demands, it’s not always easy to maintain smooth operations. Yet, within these challenges lie potential opportunities to build resilience, innovate, and fuel growth.In this blog, we’ll explore some of the most pressing supply chain challenges faced by SMMs and how strategic actions, along with the right support, can transform these…
The Manufacturing Extension Partnership National Network (MEPNN) advances U.S. manufacturing by helping small and medium-sized manufacturers grow, make operational improvements and reduce risk. The Network has MEP Centers in all 50 states and Puerto Rico. Each Center is a partnership between the federal government and a variety of public or private entities, including state, university and nonprofit organizations.The numbers speak for themselves: Manufacturers that work with their local MEP Center see real results. Since 2000*, the MEP National Network has worked with 77,409 manufacturers, leading to $60.0 billion in new sales and $26.2 billion in cost savings. Those are big…
Every year, employers across the United States open their doors to curious kids, inviting them to experience a day in the life of their parents at work. On April 25, 2025, Take a Child to Work Day and Beyond will give children whose parents work in manufacturing a fun opportunity to explore technologies like robots, simulations and more. While Take a Child to Work Day (TACTWD) isn’t specifically designed to inspire future careers in manufacturing, it offers a great way for children to learn about the innovative side of the industry. For some, it might even spark an interest in working in…
The United Kingdom is undergoing one of the most significant tax regime transformations in recent memory. For decades, the remittance basis of taxation allowed non-domiciled (non-dom) UK residents to shield foreign income and gains from HMRC — provided they remained outside the UK or were not remitted to it. That system will end on 6 April 2025. The window to prepare is closing rapidly, and the financial implications for affected taxpayers could run into tens of thousands — in some cases, millions. If you are a non-dom or long-term resident with offshore income, this article provides your 90-day action plan…
Deutsche Berghütten sind der perfekte Aufenthaltsort für Abenteurer, die alles hinter sich lassen, um der Natur näher zu sein. Viele dieser Häuser befinden sich in abgelegenen Regionen wie den Alpen, dem Schwarzwald und dem Harz. Diese Häuser dienen in der Regel als Unterkunft für Outdoor-Abenteurer, insbesondere für Wanderer, aber viele Menschen leben dort auch dauerhaft, weit weg von der Hektik des städtischen Lebens. Eine der größten Herausforderungen beim Leben in Berghütten ist das Fehlen eines zuverlässigen Stromnetzes. Mobile Stromspeicher sind in diesen Fällen von entscheidender Bedeutung, da sie es Ihnen ermöglichen, die Sonnenenergie effektiv zu Ihrem Vorteil zu nutzen und…










