Agentic AI Learning Month (S1) - Building Multimodal Systems on AI


May 29, 09:00AM PST(04:00PM GMT).
  • Free 547 Attendees
Description
Speaker

Join us for an exclusive series of webinars and workshops designed to enhance your AI skills and accelerate productivity, with focus on AI coding, DeepSeek, Hugging Face, GraphRAG, Intel OPEA, etc.. Whether you're building local LLMs, optimizing AI workflows, or deploying AI agents, these live sessions—led by industry experts—will provide invaluable insights and hands-on experience.

Session 1: Building Multimodal Systems on AI

Going from voice to text to voice, this hands-on workshop uses PyTorch to build GPU acceleration into multimodal AI PC applications. Participants learn how to implement and optimize audio-processing pipelines, combining speech recognition and speech synthesis into coherent applications. Transformer-based models will be explored, harnessing GPU acceleration and optimizing techniques on AI PCs. Real-world experience and skills will be gained at creating responsive applications processing and generating audio effectively.

Using PyTorch to enhance GPU acceleration and discovering the capabilities of AI PCs, these workshop sessions will cover:

  • Configuring and leveraging GPU resources effectively for audio AI workloads
  • Implementing speech-to-tech conversion using transformer-based models on consumer hardware
  • Building and optimize text-to-speech systems for responsive audio generation
  • Creating end-to-end audio processing pipelines combining multiple AI models
  • Developing responsive multi-modal applications that process and generate audio in near real-time
  • The workshop is designed for developers and AI practitioners at the expert level.

    Python programming – Requires intermediate-level Python skills, including familiarity with functions, classes, and error handling

    PyTorch basics – Assumes familiarity with PyTorch and basic neural network concepts

    Deep Learning Fundamentals – Requires understanding of basic concepts, such as training, inference, and model architecture

    Development environment – Presumes experience with Jupyter notebooks and package management (pip/conda)

    Speakers:
    - Praveen Kundurthy, AI Solutions Engineer at Intel
    - Rakshith Krishnappa, Software Engineer at Intel

    Venue:
    virtual, join from anywhere.

    Upcoming Sessions:

  • Session 1 (May 29th): Building Multimodal Systems on AI
  • Session 2 (June 4th): Generating Code with DeepSeek
  • Session 3 (June 11th): Enabling High-Performance Execution of GNN
  • Session 4 (June 12th): Deploying Agentic AI Applications with OPEA
  • Session 5 (June 19th): Up and Running with AWS and OPEA GraphRAG
  • Session 6 (June 26th): Building an LLM-Powered Chatbot with Streamlit and Hugging Face
  • Praveen Kundurthy, Rakshith Krishnappa

    The event ended.
    Watch Recording
    *Recordings hosted on Youtube, click the link will open the Youtube page.