SCIoT 2025 July 7-9, 2025 in Dallas, USA. The 2nd International Symposium on System and Software Reliability.

SCIoT 2025 Keynote 1

Test or Trust? Why LLM Applications and Generative AI Need Bulletproof Security


Abstract


As Large Language Models (LLMs) rapidly transform software development and deployment, a critical question emerges: Can we trust these powerful AI systems, and how rigorously must they be tested? This talk explores the urgent need for specialized security testing in LLM applications. An example for testing will be AI-ALT (Automated LLM Testing), a novel platform designed to detect and prevent vulnerabilities in generative AI systems. This platform was sponsored by Secure IVAI over the past year as part of SE4485.

We'll dive into the unique security challenges posed by LLMs, including the OWASP Top 10 vulnerabilities specific to AI applications, from prompt injection attacks and data poisoning to unauthorized information disclosure and excessive agency risks. Through live demonstrations and real-world examples, you'll see how seemingly innocent interactions can expose critical vulnerabilities that traditional security testing misses.

Perfect for software engineering students interested in the intersection of AI, security, and software architecture, this talk will equip you with practical knowledge about:

Join us to discover why "move fast and break things" doesn't work when the "things" are AI systems that millions depend on, and learn how the next generation of security tools can help build a safer AI-powered future.

Speaker


Mark Bentsen's avatar
Dr. Mark Bentsen USA

Director of AI

Secure IV AI


Mark Bentsen leads AI adoption and quality assurance, drawing on 20 years of experience in industry and academia. At Secure IVAI, which he co-founded, he guides companies in responsible AI implementation, building on his success integrating generative AI and leading large engineering teams at FedEx and with a finance and healthcare solutions company.

He shapes academic AI research through university partnerships while advancing AI literacy through conference presentations and corporate training. He holds the CTAL (Full) certification from the International Software Testing Qualifications Board (ISTQB) and is a certified Project Management Professional (PMP).