Large Multimodal Model Prompting with Gemini

Course Syllabus

What You'll Learn

About This Course

This course explores how to utilize Google's Gemini model family to build powerful multimodal applications that combine text, images, and videos. Through hands-on examples, you will learn how to optimize prompts, leverage cross-modal reasoning, and integrate real-time data for dynamic and interactive applications.

Note: Due to technical requirements, downloadable notebooks are provided to enable hands-on practice.

Course Outline

Who Should Join?

This course is for developers aiming to build advanced multimodal applications using text, images, and videos. Prior experience with AI and basic programming knowledge is recommended.