Share on LinkedIn Feed Twitter Facebook

Generate Smarter Generative AI Outputs

school 4 activities
update Last updated 6 months
person Managed by Google Cloud
This learning path is for developers who want to build applications with generative AI. Learn how to develop an AI project on Google Cloud, use diffusion models for image generation, and build a search application with Vector Search and embeddings. Then, dive deeper into multimodal prompts and multimodal RAG with Gemini to generate text and visual data.
Start learning path
Activity Thumbnail for Introduction to AI and Machine Learning on Google Cloud
01 Introduction to AI and Machine Learning on Google Cloud
book Course
access_time 8 hours
show_chart Introductory

This course introduces the AI and machine learning (ML) offerings on Google Cloud that build both predictive and generative AI projects. It explores the technologies, products, and tools available throughout the data-to-AI life cycle, encompassing AI foundations, development, and solutions....

Start course
Activity Thumbnail for Introduction to Image Generation
02 Introduction to Image Generation
book Course
access_time 30 minutes
show_chart Introductory

This course introduces diffusion models, a family of machine learning models that recently showed promise in the image generation space. Diffusion models draw inspiration from physics, specifically thermodynamics. Within the last few years, diffusion models became popular in both research...

Start course
Activity Thumbnail for Vector Search and Embeddings
03 Vector Search and Embeddings
book Course
access_time 2 hours
show_chart Intermediate

This course introduces Vertex AI Vector Search and describes how it can be used to build a search application with large language model (LLM) APIs for embeddings. The course consists of conceptual lessons on vector search and text embeddings, practical...

Start course
Activity Thumbnail for Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
04 Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
book Course
access_time 4 hours 45 minutes
show_chart Intermediate

Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond...

Start course