Generate Smarter Generative AI Outputs

school 4 activities

update Last updated 6 months

person Managed by Google Cloud

This learning path is for developers who want to build applications with generative AI. Learn how to develop an AI project on Google Cloud, use diffusion models for image generation, and build a search application with Vector Search and embeddings. Then, dive deeper into multimodal prompts and multimodal RAG with Gemini to generate text and visual data.

Start learning path

Activity Thumbnail for Introduction to AI and Machine Learning on Google Cloud

01 Introduction to AI and Machine Learning on Google Cloud

book Course

access_time 8 hours

show_chart Introductory

This course introduces the AI and machine learning (ML) offerings on Google Cloud that build both predictive and generative AI projects. It explores the technologies, products, and tools available throughout the data-to-AI life cycle, encompassing AI foundations, development, and solutions....

Start course

Activity Thumbnail for Introduction to Image Generation

02 Introduction to Image Generation

book Course

access_time 30 minutes

show_chart Introductory

This course introduces diffusion models, a family of machine learning models that recently showed promise in the image generation space. Diffusion models draw inspiration from physics, specifically thermodynamics. Within the last few years, diffusion models became popular in both research...

Start course

Activity Thumbnail for Vector Search and Embeddings

03 Vector Search and Embeddings

book Course

access_time 2 hours

show_chart Intermediate

This course introduces Vertex AI Vector Search and describes how it can be used to build a search application with large language model (LLM) APIs for embeddings. The course consists of conceptual lessons on vector search and text embeddings, practical...

Start course

Activity Thumbnail for Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

04 Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

book Course

access_time 4 hours 45 minutes

show_chart Intermediate

Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond...

Start course

Google Cloud Skills Boost

Generate Smarter Generative AI Outputs