π AI Engineer | I Help Build High-Performance GenAI & Computer Vision Solutions | PyTorch - TensorFlow - Built 5+ AI Projects | Python Expert | Open to Entry-Level AI & ML Roles
π Working on Perceptive Multimodal Generative AI - A unified platform for creative AI
π± Learning Agentic AI, Advanced LLMs, and Deployment Strategies
π― Looking to collaborate on AI Research, Generative AI, and Open Source Projects
π¬ Ask me about Deep Learning, Computer Vision, Stable Diffusion, GANs
π§ Reach me at vimalvimal1293@gmail.com
π― Tech Stack: Python | PyTorch | Stable Diffusion | CLIP | GANs | Latent Diffusion Models
A unified multimodal AI platform integrating comic generation, image style transfer, text-to-image synthesis, animation, inpainting, and 2D-to-3D reconstruction. Implements cutting-edge deep learning architectures including Stable Diffusion, PersonaGPT, VGG19, AdaIN, and voxel-based 3D modeling.
Key Features:
- β AI-powered comic generation with narrative synthesis
- β Neural style transfer preserving 98% structural similarity
- β CLIP-based text-to-image with 96% semantic accuracy
- β Depth estimation & 3D reconstruction (91% accuracy)
- β Research-backed implementation (2 published papers)
π― Tech Stack: Python | YOLOv5 | YOLOR | VGG16 | TensorFlow | OpenCV
Real-time deep learning system for road safety monitoring using state-of-the-art object detection models. Achieves high-accuracy speed breaker detection for autonomous vehicles and driver assistance systems.
Key Achievements:
- β Multi-model ensemble (YOLOv5, YOLOR, VGG16)
- β Real-time processing for ADAS applications
- β Comprehensive dataset creation & annotation
- β Deployment-ready inference pipeline
π― Tech Stack: Python | Scikit-Learn | SVM | Random Forest | KNN | Signal Processing
Real-time EMG sensor-based hand gesture recognition system for intuitive human-computer interaction and assistive device control. Implements multiple ML classifiers with feature engineering for robust gesture classification.
Key Features:
- β Real-time EMG signal processing & classification
- β Multi-algorithm comparison (SVM, KNN, RF, Naive Bayes)
- β Virtual mouse controller implementation
- β Feature extraction (MAV, ZCR, RMS, Variance)
- β 2 GitHub Stars
π Perceptive Multimodal Generative AI (PMG-1) Authors: Rithani M., Sidesh Sundar S., Vimal Dharan N., SyamDev R. S. Focus: Comic generation, style transfer, and multimodal content creation
π Perceptive Multimodal Generative AI (PMG-2) Authors: Rithani M., Sidesh Sundar S., Vimal Dharan N. Focus: Text-to-image synthesis, animation, and 2D-to-3D reconstruction
β EMG-Based Hand Gesture Recognition for Assistive Technologies Authors: Vimal Dharan N., [Co-authors TBD] Focus: Real-time EMG signal processing, feature engineering, and ML-based gesture classification for human-computer interaction
AI & Machine Learning
βββ Deep Learning (CNNs, RNNs, Transformers, GANs)
βββ Computer Vision (Object Detection, Segmentation, Style Transfer)
βββ Natural Language Processing (CLIP, GPT-based models)
βββ Generative AI (Stable Diffusion, Latent Diffusion Models)
βββ 3D Vision (Depth Estimation, Voxel Modeling, Mesh Generation)
Specializations
βββ Multimodal AI Systems
βββ Real-time Object Detection (YOLO family)
βββ Neural Style Transfer (VGG19, AdaIN)
βββ Signal Processing (EMG, Time-series)
βββ Research & Technical Writing
Development
βββ Python (Advanced), C/C++ (Intermediate)
βββ PyTorch, TensorFlow, Keras
βββ Git, Docker, Linux
βββ Jupyter, Google Colab, Weights & Biases
- π₯ Enhancing PMG-AI with ControlNet and LCM integration
- π§βπ¬ Exploring AI Safety and alignment research
- π Pursuing certifications in Advanced Deep Learning
- π Building Agentic AI systems with LangChain
- π‘ Contributing to open-source AI projects
- π Actively seeking AI Engineer / ML Engineer roles (10+ LPA)
π Currently seeking: AI Engineer, ML Engineer, Deep Learning Engineer, Computer Vision Engineer roles π Location: Chennai, Tamil Nadu (Open to Bengaluru & Remote) πΌ Expected CTC: 10+ LPA π Availability: Immediate / 1 month notice
π€ Collaborating on AI Research Papers π Open Source Contributions π₯ Startup Opportunities in AI/GenAI π» Freelance ML/DL Projects
- π¨βπ« B.Tech in CSE (AI Specialization) - Amrita Vishwa Vidyapeetham
- π 2 Research Papers on Multimodal Generative AI
- π₯ Top Performer in Deep Learning and Computer Vision projects
- β GitHub Stars: Contributing to open-source AI projects
- π οΈ Technical Skills: 15+ AI/ML frameworks and tools mastered
- π‘ Innovation: Created unified platform for multimodal generative AI
- π§ Agentic AI - Building autonomous AI agents with LangChain & AutoGPT
- π LLM Fine-tuning - PEFT, LoRA, QLoRA techniques
- βοΈ MLOps - Docker, Kubernetes, CI/CD for ML models
- π‘οΈ AI Safety - Alignment, interpretability, and robustness
- π Advanced Computer Vision - NeRF, Gaussian Splatting, 3D Vision