π Silver Medalist | Machine Learning Engineer | AI Researcher
I specialize in Speech AI, NLP, Computer Vision, and Generative AI with a proven track record of impactful research and innovative projects. From fine-tuning TTS/ASR systems to creating real-time multilingual bots, my work spans academic brilliance and industrial innovation.
π Currently:
- Lead Machine Learning Engineer at Idrak AI Ltd.
- Pursuing Ph.D. opportunities in Generative AI and Multimodal AI
- Exploring collaborations to scale AI-driven products
π Achievements:
- π₯ Silver Medalist (Ranked 2nd out of 80 students)
- π Programming Champion: Vigorous Spark Competition (2016)
- Speech & Audio: TTS, ASR, Noise Cancellation, Speaker Diarization
- NLP: LLM Fine-tuning, RAG, LangChain, LangGraph, Function Calling
- Computer Vision: YOLOv8, DINO, Image Matting, Object Detection
- Time Series: ECG Analysis, Anomaly Detection, Trading Indicators
- Languages: Python, C#, JavaScript, SQL
- Frameworks: PyTorch, TensorFlow, Huggingface, LangChain
- DevOps: Docker, AWS
- Databases: Neo4j, MongoDB, Qdrant, FAISS
- Fine-tuned Whisper ASR for low-latency transcription
- Built multilingual TTS systems (Urdu, SoVits, VITS)
- Created Speech Analytics Dashboards with insights on tone, sentiment, and diarization
- Designed calling bots for customer interaction, improving sales by 6%
- π Website: Portfolio
- πΌ LinkedIn: Muhammad Ali Abbas
- π GitHub: m-aliabbas
- βοΈ Medium: @m-aliabbas
- π Google Scholar: Scholar Profile
- π ORCID: ORCID Profile
- alee_tts: Advanced TTS systems for multilingual speech synthesis
- m-aliabbas: Portfolio and core projects repository
- dino_modified_mim: Enhanced DINO with Masked Image Modeling for self-supervised learning
- papia_language_modeling: Pretraining BERT and GPT models for the Papia language
- Swin Transformer for COVID-19 Diagnosis (Read)
- TuRF-based Feature Selection for Health Factors (Read)
- Hybrid Deep Learning for Pneumonia Detection (Read)
π» Always Exploring, Always Learning