Building Advanced Multi-Agent AI Workflows by Leveraging AutoGen and Semantic Kernel

In this tutorial, we walk you through the seamless integration of AutoGen and Semantic Kernel with Google’s Gemini Flash model. We begin by setting up our GeminiWrapper and SemanticKernelGeminiPlugin classes to bridge the generative power of Gemini with AutoGen’s multi-agent orchestration. From there, we configure specialist agents, ranging from code reviewers to creative analysts, demonstrating how we can leverage AutoGen’s ConversableAgent API alongside Semantic Kernel’s decorated functions for text analysis, summarization, code review, and creative problem-solving. By combining AutoGen’s robust agent framework with Semantic Kernel’s function-driven approach, we create an advanced AI assistant that adapts to a variety of tasks with structured, actionable insights.

Copy CodeCopiedUse a different Browser

!pip install pyautogen semantic-kernel google-generativeai python-dotenvimport osimport asynciofrom typing import Dict, Any, Listimport autogenimport google.generativeai as genaifrom semantic_kernel import Kernelfrom semantic_kernel.functions import KernelArgumentsfrom semantic_kernel.functions.kernel_function_decorator import kernel_function

We start by installing the core dependencies: pyautogen, semantic-kernel, google-generativeai, and python-dotenv, ensuring we have all the necessary libraries for our multi-agent and semantic function setup. Then we import essential Python modules (os, asyncio, typing) along with autogen for agent orchestration, genai for Gemini API access, and the Semantic Kernel classes and decorators to define our AI functions.

Copy CodeCopiedUse a different Browser

GEMINI_API_KEY = "Use Your API Key Here" genai.configure(api_key=GEMINI_API_KEY)config_list = [   {       "model": "gemini-1.5-flash",       "api_key": GEMINI_API_KEY,       "api_type": "google",       "api_base": "https://generativelanguage.googleapis.com/v1beta",   }]

We define our GEMINI_API_KEY placeholder and immediately configure the genai client so all subsequent Gemini calls are authenticated. Then we build a config_list containing the Gemini Flash model settings, model name, API key, endpoint type, and base URL, which we’ll hand off to our agents for LLM interactions.

Copy CodeCopiedUse a different Browser

class GeminiWrapper:   """Wrapper for Gemini API to work with AutoGen"""     def __init__(self, model_name="gemini-1.5-flash"):       self.model = genai.GenerativeModel(model_name)     def generate_response(self, prompt: str, temperature: float = 0.7) -> str:       """Generate response using Gemini"""       try:           response = self.model.generate_content(               prompt,               generation_config=genai.types.GenerationConfig(                   temperature=temperature,                   max_output_tokens=2048,               )           )           return response.text       except Exception as e:           return f"Gemini API Error: {str(e)}"

We encapsulate all Gemini Flash interactions in a GeminiWrapper class, where we initialize a GenerativeModel for our chosen model and expose a simple generate_response method. In this method, we pass the prompt and temperature into Gemini’s generate_content API (capped at 2048 tokens) and return the raw text or a formatted error.

Copy CodeCopiedUse a different Browser

class SemanticKernelGeminiPlugin:   """Semantic Kernel plugin using Gemini Flash for advanced AI operations"""     def __init__(self):       self.kernel = Kernel()       self.gemini = GeminiWrapper()     @kernel_function(name="analyze_text", description="Analyze text for sentiment and key insights")   def analyze_text(self, text: str) -> str:       """Analyze text using Gemini Flash"""       prompt = f"""       Analyze the following text comprehensively:             Text: {text}             Provide analysis in this format:       - Sentiment: [positive/negative/neutral with confidence]       - Key Themes: [main topics and concepts]       - Insights: [important observations and patterns]       - Recommendations: [actionable next steps]       - Tone: [formal/informal/technical/emotional]       """             return self.gemini.generate_response(prompt, temperature=0.3)     @kernel_function(name="generate_summary", description="Generate comprehensive summary")   def generate_summary(self, content: str) -> str:       """Generate summary using Gemini's advanced capabilities"""       prompt = f"""       Create a comprehensive summary of the following content:             Content: {content}             Provide:       1. Executive Summary (2-3 sentences)       2. Key Points (bullet format)       3. Important Details       4. Conclusion/Implications       """             return self.gemini.generate_response(prompt, temperature=0.4)     @kernel_function(name="code_analysis", description="Analyze code for quality and suggestions")   def code_analysis(self, code: str) -> str:       """Analyze code using Gemini's code understanding"""       prompt = f"""       Analyze this code comprehensively:             ```       {code}       ```             Provide analysis covering:       - Code Quality: [readability, structure, best practices]       - Performance: [efficiency, optimization opportunities]       - Security: [potential vulnerabilities, security best practices]       - Maintainability: [documentation, modularity, extensibility]       - Suggestions: [specific improvements with examples]       """             return self.gemini.generate_response(prompt, temperature=0.2)     @kernel_function(name="creative_solution", description="Generate creative solutions to problems")   def creative_solution(self, problem: str) -> str:       """Generate creative solutions using Gemini's creative capabilities"""       prompt = f"""       Problem: {problem}             Generate creative solutions:       1. Conventional Approaches (2-3 standard solutions)       2. Innovative Ideas (3-4 creative alternatives)       3. Hybrid Solutions (combining different approaches)       4. Implementation Strategy (practical steps)       5. Potential Challenges and Mitigation       """             return self.gemini.generate_response(prompt, temperature=0.8)

We encapsulate our Semantic Kernel logic in the SemanticKernelGeminiPlugin, where we initialize both the Kernel and our GeminiWrapper to power custom AI functions. Using the @kernel_function decorator, we declare methods like analyze_text, generate_summary, code_analysis, and creative_solution, each of which constructs a structured prompt and delegates the heavy lifting to Gemini Flash. This plugin lets us seamlessly register and invoke advanced AI operations within our Semantic Kernel environment.

Copy CodeCopiedUse a different Browser

class AdvancedGeminiAgent:   """Advanced AI Agent using Gemini Flash with AutoGen and Semantic Kernel"""     def __init__(self):       self.sk_plugin = SemanticKernelGeminiPlugin()       self.gemini = GeminiWrapper()       self.setup_agents()     def setup_agents(self):       """Initialize AutoGen agents with Gemini Flash"""             gemini_config = {           "config_list": [{"model": "gemini-1.5-flash", "api_key": GEMINI_API_KEY}],           "temperature": 0.7,       }             self.assistant = autogen.ConversableAgent(           name="GeminiAssistant",           llm_config=gemini_config,           system_message="""You are an advanced AI assistant powered by Gemini Flash with Semantic Kernel capabilities.           You excel at analysis, problem-solving, and creative thinking. Always provide comprehensive, actionable insights.           Use structured responses and consider multiple perspectives.""",           human_input_mode="NEVER",       )             self.code_reviewer = autogen.ConversableAgent(           name="GeminiCodeReviewer",           llm_config={**gemini_config, "temperature": 0.3},           system_message="""You are a senior code reviewer powered by Gemini Flash.           Analyze code for best practices, security, performance, and maintainability.           Provide specific, actionable feedback with examples.""",           human_input_mode="NEVER",       )             self.creative_analyst = autogen.ConversableAgent(           name="GeminiCreativeAnalyst",           llm_config={**gemini_config, "temperature": 0.8},           system_message="""You are a creative problem solver and innovation expert powered by Gemini Flash.           Generate innovative solutions, and provide fresh perspectives.           Balance creativity with practicality.""",           human_input_mode="NEVER",       )             self.data_specialist = autogen.ConversableAgent(           name="GeminiDataSpecialist",           llm_config={**gemini_config, "temperature": 0.4},           system_message="""You are a data analysis expert powered by Gemini Flash.           Provide evidence-based recommendations and statistical perspectives.""",           human_input_mode="NEVER",       )             self.user_proxy = autogen.ConversableAgent(           name="UserProxy",           human_input_mode="NEVER",           max_consecutive_auto_reply=2,           is_termination_msg=lambda x: x.get("content", "").rstrip().endswith("TERMINATE"),           llm_config=False,       )     def analyze_with_semantic_kernel(self, content: str, analysis_type: str) -> str:       """Bridge function between AutoGen and Semantic Kernel with Gemini"""       try:           if analysis_type == "text":               return self.sk_plugin.analyze_text(content)           elif analysis_type == "code":               return self.sk_plugin.code_analysis(content)           elif analysis_type == "summary":               return self.sk_plugin.generate_summary(content)           elif analysis_type == "creative":               return self.sk_plugin.creative_solution(content)           else:               return "Invalid analysis type. Use 'text', 'code', 'summary', or 'creative'."       except Exception as e:           return f"Semantic Kernel Analysis Error: {str(e)}"     def multi_agent_collaboration(self, task: str) -> Dict[str, str]:       """Orchestrate multi-agent collaboration using Gemini"""       results = {}             agents = {           "assistant": (self.assistant, "comprehensive analysis"),           "code_reviewer": (self.code_reviewer, "code review perspective"),           "creative_analyst": (self.creative_analyst, "creative solutions"),           "data_specialist": (self.data_specialist, "data-driven insights")       }             for agent_name, (agent, perspective) in agents.items():           try:               prompt = f"Task: {task}\n\nProvide your {perspective} on this task."               response = agent.generate_reply([{"role": "user", "content": prompt}])               results[agent_name] = response if isinstance(response, str) else str(response)           except Exception as e:               results[agent_name] = f"Agent {agent_name} error: {str(e)}"             return results     def run_comprehensive_analysis(self, query: str) -> Dict[str, Any]:       """Run comprehensive analysis using all Gemini-powered capabilities"""       results = {}             analyses = ["text", "summary", "creative"]       for analysis_type in analyses:           try:               results[f"sk_{analysis_type}"] = self.analyze_with_semantic_kernel(query, analysis_type)           except Exception as e:               results[f"sk_{analysis_type}"] = f"Error: {str(e)}"             try:           results["multi_agent"] = self.multi_agent_collaboration(query)       except Exception as e:           results["multi_agent"] = f"Multi-agent error: {str(e)}"             try:           results["direct_gemini"] = self.gemini.generate_response(               f"Provide a comprehensive analysis of: {query}", temperature=0.6           )       except Exception as e:           results["direct_gemini"] = f"Direct Gemini error: {str(e)}"             return results

We add our end-to-end AI orchestration in the AdvancedGeminiAgent class, where we initialize our Semantic Kernel plugin, Gemini wrapper, and configure a suite of specialist AutoGen agents (assistant, code reviewer, creative analyst, data specialist, and user proxy). With simple methods for semantic-kernel bridging, multi-agent collaboration, and direct Gemini calls, we enable a seamless, comprehensive analysis pipeline for any user query.

Copy CodeCopiedUse a different Browser

def main():   """Main execution function for Google Colab with Gemini Flash"""   print(" Initializing Advanced Gemini Flash AI Agent...")   print(" Using Gemini 1.5 Flash for high-speed, cost-effective AI processing")     try:       agent = AdvancedGeminiAgent()       print(" Agent initialized successfully!")   except Exception as e:       print(f" Initialization error: {str(e)}")       print(" Make sure to set your Gemini API key!")       return     demo_queries = [       "How can AI transform education in developing countries?",       "def fibonacci(n): return n if n <= 1 else fibonacci(n-1) + fibonacci(n-2)",       "What are the most promising renewable energy technologies for 2025?"   ]     print("\n Running Gemini Flash Powered Analysis...")     for i, query in enumerate(demo_queries, 1):       print(f"\n{'='*60}")       print(f" Demo {i}: {query}")       print('='*60)             try:           results = agent.run_comprehensive_analysis(query)                     for key, value in results.items():               if key == "multi_agent" and isinstance(value, dict):                   print(f"\n {key.upper().replace('_', ' ')}:")                   for agent_name, response in value.items():                       print(f"   {agent_name}: {str(response)[:200]}...")               else:                   print(f"\n {key.upper().replace('_', ' ')}:")                   print(f"   {str(value)[:300]}...")                 except Exception as e:           print(f" Error in demo {i}: {str(e)}")     print(f"\n{'='*60}")   print(" Gemini Flash AI Agent Demo Completed!")   print(" To use with your API key, replace 'your-gemini-api-key-here'")   print(" Get your free Gemini API key at: https://makersuite.google.com/app/apikey")if __name__ == "__main__":   main()

Finally, we run the main function that initializes the AdvancedGeminiAgent, prints out status messages, and iterates through a set of demo queries. As we run each query, we collect and display results from semantic-kernel analyses, multi-agent collaboration, and direct Gemini responses, ensuring a clear, step-by-step showcase of our multi-agent AI workflow.

In conclusion, we showcased how AutoGen and Semantic Kernel complement each other to produce a versatile, multi-agent AI system powered by Gemini Flash. We highlighted how AutoGen simplifies the orchestration of diverse expert agents, while Semantic Kernel provides a clean, declarative layer for defining and invoking advanced AI functions. By uniting these tools in a Colab notebook, we’ve enabled rapid experimentation and prototyping of complex AI workflows without sacrificing clarity or control.

Check out the Codes. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter.

The post Building Advanced Multi-Agent AI Workflows by Leveraging AutoGen and Semantic Kernel appeared first on MarkTechPost.

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签