ΑΙhub 5小时前
Interview with Flávia Carvalhido: Responsible multimodal AI
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文采访了葡萄牙波尔图大学的博士生Flávia Carvalhido,她专注于负责任的多模态人工智能研究,特别是针对医疗影像报告生成中的图像-文本多模态模型进行压力测试。Carvalhido的博士研究旨在通过识别和理解这些模型的局限性,来促进其在医疗这一安全关键领域的负责任使用,并推动压力测试成为AI开发的核心流程。她分享了研究的进展、遇到的挑战(如平衡AI的快速发展与负责任AI的缓慢步伐),以及她对AI领域的热情和对未来研究的展望。Carvalhido还分享了AI研究的挑战与机遇,以及AAAI博士生研讨会的宝贵经验。

💡 **负责任的多模态AI在医疗领域的应用**:Flávia Carvalhido的研究核心是“压力测试图像-文本多模态模型在医疗影像报告生成中的应用”,旨在提升AI在医疗这一对安全性要求极高的任务中的负责任使用。通过对现有模型进行压力测试,以发现其局限性,从而促使AI在医疗领域的应用更加明智和安全。

🚀 **研究方法与领域**:Carvalhido的研究涵盖了多模态AI、医疗影像报告生成、可解释AI、模型鲁棒性、负责任AI和压力测试等多个领域。她已完成了文献综述和研究规划,并制定了为期三年的研究方法论,以回答关键的研究问题,并积极参与学术活动以拓展视野和交流思想。

🌟 **探索未开发领域与应对挑战**:Carvalhido的研究亮点在于“压力测试”这一AI领域中尚待深入探索的方向,这既带来了发现新知识和提出开创性方案的机会,也伴随着作为先驱者的挑战。她需要确保研究的原创性、高质量和有实际影响力的论证,同时还要应对AI领域快速的知识更新迭代,特别是如何在多模态生成AI的飞速发展与负责任AI研究相对缓慢的步伐之间取得平衡。

🔬 **未来研究方向**:接下来的研究将集中于对医疗影像报告生成任务中的不同方面进行探索,包括利用最先进的模型并结合XAI技术进行比较评估,以建立一个可用于压力测试方法学的基线。目标是理解生成模型的行为,并通过引入“挑战性”或“压力诱导性”输入来探测模型,记录性能和行为的变化,并进一步明确“压力诱导性输入”和“压力测试”在AI中的概念定义。

💡 **选择AI研究的动力**:Carvalhido对AI的热情源于对技术的好奇心和探索世界运作方式的渴望。她认为AI领域充满无限可能,并且在不断变化,这满足了她持续学习和发现的动力。她希望通过AI研究的发现,为世界留下积极的印记。

In this interview series, we’re meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. In this latest interview, we hear from Flávia Carvalhido who is a PhD student at the University of Porto. We find out about her work on responsible multimodal AI, what inspired her to study AI, and how she found the Doctoral Consortium experience.

Tell us a bit about your PhD – where are you studying, and what is the topic of your research?

My PhD programme is on Informatics Engineering in the Faculty of Engineering at the University of Porto, where I also got both my Bachelor’s and Master’s in the same field. My thesis research project is focused on responsible multimodal AI, titled “Stress Testing of Image-Text Multimodal Models in Medical Image Report Generation”, supervised by Professor Henrique Lopes Cardoso and Professor Vítor Cerqueira and developed in the LIACC research laboratory. It is a research work developed within the Center for Responsible AI project agenda in a collaboration managed by Professor Carlos Soares.

The goal of this research project is to enhance the responsible usage of multimodal AI for medical image report generation, a safety-critical healthcare task, by employing stress testing techniques on state-of-the-art models in order to find their limitations. By doing so, we hope to foster a more informed usage of these models and establish stress testing as a core procedure of responsible AI development.

Could you give us an overview of the research you’ve carried out so far during your PhD?

Having already defended my thesis proposal plan, I am currently at a developmental stage of the PhD work. A complete literature review has already been carried out, along with a planning stage to guide the research. I have defined a methodology plan in order to answer our research questions over the span of three years, conducting research that covers several topics: Multimodal AI, Medical Image Report Generation, Explainable AI, Model Robustness, Responsible AI, and Stress Testing. Additionally, I have taken this planning stage as an opportunity to acquire further knowledge and share my ideas with other researchers by participating in a plurality of events: summer schools, symposiums, and conferences, such as AAAI’25.

Is there an aspect of your research that has been particularly interesting?

Working on a topic that is underdeveloped in AI – stress testing – gives me a unique opportunity to navigate a novel research direction. This brings a lot of opportunities but also challenges to the table. It allows me to potentially make new discoveries and groundbreaking proposals to define this concept in AI, but I also have to be original and pioneer this research direction, conducting high-quality and well-founded research, that can have a real impact.

Moreover, staying updated with the novel discoveries in the field has given me an interesting perspective on how fast research work is carried out in multimodal generative AI. Every week, new papers are published, meaning my own research is in constant need of development and updating to be in line with the most recent findings. Given that I am working with a safety-critical multimodal healthcare task – medical image report generation – the usage of AI must follow safe and responsible principles, and many times these values aren’t thoroughly prioritized. My research aims to address this gap, but balancing the fast progress in multimodal AI with the slower pace of responsible AI research poses a key challenge in my PhD.

Flávia presenting her poster at the Doctoral Consortium at AAAI 2025.

What are your plans for building on your research so far during the PhD – what aspects will you be investigating next?

Regarding medical image report generation, the object of my PhD, I will be exploring the different aspects of the task. By using state-of-the-art models and further comparing them using XAI techniques, while evaluating their performance, I hope to establish a representative baseline of approaches on which to apply my initial stress testing methodologies. Understanding generative model behavior is our end goal, so exploring the many models used for this task will be my next step.

After doing so, I will start by probing the models with generally “challenging” or stress-inducing inputs, as our first attempt at stress testing, to then document any changes in model performance and generative behavior. Further establishing what constitutes a stress-inducing input and, overall, stress testing as a concept within AI is a crucial step in this stage.

What made you want to study AI?

AI has always been fascinating to me. I have always had this love for technology. Since I was little, I have always wanted to know how computers work, how the world works. My parents had to get me a computer really early, to get me off theirs. Driven by this curiosity, I ended up studying engineering, specifically Informatics and Computer Engineering. That was my first real contact with AI. Out of all the classes I took, any AI class would always be the one I would be most interested in. Because of all the possibilities AI could unlock, and because it is forever changing.

AI will never get boring; there will always be something new to learn, something new to discover, and that is exactly what drives me and my curiosity. The positive impact someone can have with new AI discoveries is just the kind of legacy I want to leave behind.

What advice would you give to someone thinking of doing a PhD in the field?

Generally, PhDs are a challenge and a promise to oneself. A challenge to your determination, motivation, and work. A promise to commit your everyday to your own work and career. You are your own boss, but as your boss, you have to empower yourself and push forward.

A PhD in AI has these same challenges with a twist. AI research moves fast, and so should you. Be prepared for changes and new discoveries, be adaptable and, most of all, stay focused on your goals. AI research has room for all perspectives and contributions, so if you are thinking about doing a PhD in AI, be prepared to jump right into it and make the most of it, for yourself and for research.

How was the AAAI Doctoral Consortium, and the AAAI conference experience in general?

Having attended the AAAI Doctoral Consortium was an amazing experience that opened my eyes to the multitude of different research ideas out there and different perspectives on the present and future of AI. Meeting fellow young researchers and getting to listen to experienced professionals in the area at such a massive event was an incredible and enriching experience that I will never forget. It was also my very first conference experience, and what better way than to start with one of the best and biggest conferences in AI? I hope I get the chance to attend the AAAI conference again in the near future!

Could you tell us an interesting (non-AI related) fact about you?

I live in Portugal, but I love to travel around. One would think that living in a country that has an amazing coastline, lush mountains, and a long historical background would make it less interesting to visit other countries, but I just love experiencing different cultures. I have travelled to more than 10 different European countries, most of which I visited while volunteering! My first trip outside of Europe was actually while travelling to AAAI’25 in Philadelphia, which is just an awesome city to learn about the history of the USA and its establishment as an independent country. My dream destination is Japan, simply because of the cultural clash I would experience. Bonus: I am a foodie, so travelling gives me plenty of opportunities to try dishes that I could never try at home.

About Flávia

Flávia Carvalhido is a 24-year-old PhD candidate from Porto, Portugal. She started her PhD studies in 2023, having obtained her Master’s in Informatics Engineering earlier that year. She is working under the supervision of Professor Henrique Lopes Cardoso and Professor Vítor Cerqueira as a student researcher in the LIACC laboratory. Her research areas of interest include Responsible AI, Multimodal Models, Natural Language Processing, Computer Vision, and AI for Healthcare. Aside from her academic interests, she is passionate about volunteering, having been involved in educational and environmental projects.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

负责任AI 多模态AI 医疗影像报告生成 压力测试 Flávia Carvalhido
相关文章