Research Interests

My research spans machine learning and deep learning, with a focus on computer vision and image processing. This includes object detection, segmentation, tracking, pattern recognition, pose and depth estimation, and motion capture. I work with neural networks and attention mechanisms like CNNs, transformers (ViTs), convolutional attention, and models using cross-feature dependencies. I’m also interested in semi- and weakly-supervised learning approaches. I explore vision-language models and generative AI (VLMs, LLMs), and work with 3D and multimodal data—such as LiDAR, point clouds, volumetric imaging, and temporal data. Much of my work is applied in scientific and real-world contexts, including medical imaging, microscopy, diagnostic systems, anomaly detection, surveillance, and OCR.

Publications