Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Manik Hossain 's picture

Manik Hossain

manik-hossain
·

AI & ML interests

None yet

Organizations

Hugging Face Discord Community's profile picture Hugging Face MCP Course's profile picture Hugging Science's profile picture Agents & MCP Hackathon - Winter 25's profile picture

manik-hossain 's collections 3

startup
  • PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

    Paper • 2312.04461 • Published Dec 7, 2023 • 62
  • InstantID: Zero-shot Identity-Preserving Generation in Seconds

    Paper • 2401.07519 • Published Jan 15, 2024 • 57
  • StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

    Paper • 2306.07691 • Published Jun 13, 2023 • 12
asr
  • Robust Speech Recognition via Large-Scale Weak Supervision

    Paper • 2212.04356 • Published Dec 6, 2022 • 40
Audio
  • Robust Speech Recognition via Large-Scale Weak Supervision

    Paper • 2212.04356 • Published Dec 6, 2022 • 40
startup
  • PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

    Paper • 2312.04461 • Published Dec 7, 2023 • 62
  • InstantID: Zero-shot Identity-Preserving Generation in Seconds

    Paper • 2401.07519 • Published Jan 15, 2024 • 57
  • StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

    Paper • 2306.07691 • Published Jun 13, 2023 • 12
Audio
  • Robust Speech Recognition via Large-Scale Weak Supervision

    Paper • 2212.04356 • Published Dec 6, 2022 • 40
asr
  • Robust Speech Recognition via Large-Scale Weak Supervision

    Paper • 2212.04356 • Published Dec 6, 2022 • 40
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs