Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Peng's picture
9 10 20

Yifan Peng

pyf98
tuanio's profile picture almjanx's profile picture shylockasr's profile picture
ยท
https://pyf98.github.io
  • pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Recent Activity

updated a dataset about 1 month ago
espnet/yodas_owsmv4
updated a model about 1 month ago
espnet/owsm_ctc_v3.2_ft_1B
updated a model about 1 month ago
espnet/owsm_ctc_v3.1_1B
View all activity

Organizations

NVIDIA's profile picture ESPnet's profile picture Blog-explorers's profile picture YODAS Sharing inc's profile picture Nvidia Data&Tools team's profile picture

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Running on Zero
    9
    9

    OWSM V4 Demo

    ๐ŸŒ

    This is a demo for OWSM-V4 CTC and medium model.

  • Runtime error
    55
    55

    OWSM Demo

    ๐Ÿ”Š

  • espnet/yodas_owsmv4

    Viewer โ€ข Updated Sep 1 โ€ข 4 โ€ข 2.11k โ€ข 15
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition โ€ข Updated Aug 30 โ€ข 157 โ€ข 5
Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Running on Zero
    9
    9

    OWSM V4 Demo

    ๐ŸŒ

    This is a demo for OWSM-V4 CTC and medium model.

  • Runtime error
    55
    55

    OWSM Demo

    ๐Ÿ”Š

  • espnet/yodas_owsmv4

    Viewer โ€ข Updated Sep 1 โ€ข 4 โ€ข 2.11k โ€ข 15
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition โ€ข Updated Aug 30 โ€ข 157 โ€ข 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs