AbstractPhila's picture

AbstractPhila PRO

AbstractPhil

·

https://civitai.com/user/AbstractPhila

AbstractEyes

AI & ML interests

datasets, research papers, experimentation, vision, classification, text encoders, tokenization, llms, diffusion, distillation, and more.

Recent Activity

updated a model 1 day ago

AbstractPhil/agatha-diffusion-proto

published a model 1 day ago

AbstractPhil/agatha-diffusion-proto

replied to prithivMLmods's post 4 days ago

One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. 🗣️🔥 🤗 Vision-to-VibeVoice-en [Demo]: https://huggingface.co/spaces/prithivMLmods/Vision-to-VibeVoice-en ✨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations ✨ Speech [VibeVoice-Realtime-0.5B]: https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B ✨ Vision [Qwen2.5-VL]: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct To know more about it, visit the app page or the respective model page!

View all activity

Organizations

published an article 3 months ago

Article

Moving foward with fixated vocabularies aimed at general, symbolic, and medical.

Sep 26

published an article 4 months ago

Article

The Crystalline Engine

Aug 26