About Me

Results-driven AI Automation Engineer & Full-Stack Architect specializing in designing production-ready LLM ecosystems, scalable RAG pipelines, and enterprise SaaS solutions. Proven track record of orchestrating foundational models (Gemini, CLIP, VLMs) with FAISS and Genkit to automate complex workflows, reducing manual data processing time by 60%+.

I am an AI/ML researcher specializing in multimodal AI, computer vision, and natural language processing. My work focuses on practical applications in medical imaging and computational methods for under-resourced languages.

I am currently a Research Assistant at the North South University, also contributing to the Centre for UnlieAl Artificial Intelligence Research Lab at CU Boulder. I have co-authored 3 high-impact research papers currently under review at top-tier venues including KDD, ACL, and SaTC, and my research involves processing 500GB+ of complex multimodal datasets.

My technical expertise includes building and training deep learning models using PyTorch and Hugging Face Transformers. I have extensive experience with Vision-Language Models (VLMs) such as CLIP, ViLT, BLIP, and BioViL-T for developing radiology benchmarks and nuanced hate detection. Adept at deploying multi-tenant, containerized cloud architectures (AWS, GCP, Kubernetes) supporting high-volume daily requests with 99.9% uptime.

Skills & Expertise

AI & Machine Learning

PyTorch TensorFlow Hugging Face Transformers Scikit-learn XGBoost LightGBM

Domains & Applications

Multimodal AI Computer Vision Natural Language Processing (NLP) Medical Imaging (Radiology) Time Series Forecasting Under-resourced Languages

AI & LLMOps

RAG Architecture Vector Databases (FAISS) Google Genkit Semantic Search Chunking Strategies Multi-agent Systems

Backend & Cloud

FastAPI Node.js AWS GCP Docker Kubernetes CI/CD (GitHub Actions) PostgreSQL Firebase REST API Design Multi-tenant Architecture

Frontend

Next.js 15 React 19 TypeScript Tailwind CSS Strapi CMS Prisma ORM Shadcn UI

Languages

Python TypeScript C/C++ Java SQL/NoSQL

News

  • 2025: Paper “Bridging the Gap in Low-Resource Languages” submitted to Plos One (First Author).

  • 2025: Paper “Face2Profile” submitted to KDD 2026 (Co-author).

  • 2025: Paper “Engineering Spatiotemporal Forecasts at Scale” submitted to SaTC 2026 (Co-author).

  • Feb 2025: Started role as Lab Instructor at North South University.

  • Jan 2025: Joined the Centre for UnlieAl Artificial Intelligence Research Lab at CU Boulder as a Research Assistant.

  • Oct 2024: Our paper, "RadMultiBench," was submitted to ICDE 2026.

  • Oct 2024: Our paper on "Consensus Feature Selection Pipeline" was submitted to ICDE 2026.

  • Apr 2024: Graduated from North South University with a B.S. in Computer Science and Engineering (Summa Cum Laude).

Publications

Preprints & Under Review

RadMultiBench: A Multimodal Benchmark Reveals Architectural Failures in Modern Vision-Language Models for Radiology

Md. Nahin Alam, et al. (First Author)

Under Review — Health-2025

Bridging the Gap in Low-Resource Languages: Multilingual Multimodal Hate Detection in Memes with Image Descriptions

Md. Nahin Alam (First Author)

Submitted to Plos One

Face2Profile: A Face-and-URL Dataset for Open-Ended Profile Construction

Co-author

Submitted to KDD 2026

Engineering Spatiotemporal Forecasts at Scale: Dhaka Weather with State-of-the-Art Machine Learning Models

Co-author

Submitted to SaTC 2026

Live Apps

AI • RAG • SaaS

MySoftHeaven AI Suite 🏢

A production-ready AI ecosystem featuring an intelligent lead processing engine and a Retrieval-Augmented Generation (RAG) chatbot. It automates business workflows by transforming raw data into structured leads and providing context-aware responses using vector similarity search. Built with FastAPI, FAISS, and Gemini 1.5 Flash.

  • RAG-powered Chatbot with FAISS vector indexing and semantic chunking
  • AI Lead Orchestrator for intent classification and entity extraction
  • Multi-tenant Architecture design supporting schema-per-tenant isolation
  • Automated CRM Sync concepts for Salesforce and HubSpot integration
  • • Deployed using Docker & Docker Compose for full stack portability
FastAPI Python FAISS RAG Docker Gemini AI
● Live

RBAC & Project Management System

An enterprise-grade administration platform engineered on the PERN Stack (PostgreSQL, Express.js, React 18, Node.js). It implements a strict MVC Architecture with TypeScript for type safety across the full stack. Security features include granular Role-Based Access Control (RBAC), cryptographically secure invitation workflows, and JWT stateless authentication. The backend leverages Prisma ORM for complex relation handling and Soft Delete mechanisms to ensure data integrity. The frontend utilizes Redux Toolkit for deterministic state management and Axios Interceptors for seamless token injection, wrapped in a responsive Material UI (MUI) interface.

TypeScript React 18 Redux Toolkit Node.js Express.js PostgreSQL Prisma ORM REST API MUI JWT & Bcrypt Render/Vercel
● Live

AntHive: Real-time Language Exchange Platform

A comprehensive social application built on the MERN Stack (MongoDB, Express.js, React 19, Node.js). It features high-performance real-time messaging powered by Stream Chat and seamless video conferencing integrated via the Stream Video SDK. State management is handled efficiently with Zustand, while server-side state and caching are optimized using TanStack Query. The UI is responsive and themeable, crafted with Tailwind CSS and DaisyUI. Security is enforced via JWT Authentication and HTTP-only cookies.

MERN Stack React 19 Vite Stream Chat SDK Stream Video SDK Zustand TanStack Query MongoDB Express.js Tailwind CSS DaisyUI JWT Auth
● Live

CodeGenius - AI Learning Platform

A personalized coding education platform built with Next.js 15 and React 19. It leverages Google Genkit and the Gemini 2.5 Flash model to generate custom learning paths and real-time code examples. The backend is fully serverless using Firebase (Auth, Firestore, App Hosting) for seamless scalability. Features include an interactive dashboard, secure authentication, and a polished interface styled with Tailwind CSS and Shadcn UI.

React 19 Next.js 15 TypeScript Google Genkit Gemini 2.5 Flash Firebase Tailwind CSS Shadcn UI Zod Validation Lucide Icons App Hosting
● Live

AI-Powered Resume Builder

A full-stack SaaS application that leverages Generative AI to automate resume creation. Built with React and Vite for a high-performance frontend, it integrates Google Gemini AI to generate professional summaries and experience points contextually. The backend is powered by a Strapi Headless CMS providing robust content management and **REST APIs**. Features include Clerk Authentication for secure user sessions, real-time live preview, one-click PDF export, and social sharing capabilities. Styled with Tailwind CSS and Shadcn UI for a modern, responsive aesthetic.

React 18 Vite Full Stack Google Gemini AI Strapi CMS REST API Tailwind CSS Shadcn UI Clerk Auth Real-time Preview Vercel Deployment
● Live

Duari: AI Study Abroad Companion

A full-stack education platform engineered with Next.js 15 (App Router) and TypeScript. It leverages Google's Genkit and the Gemini 2.5 Flash model to generate personalized university recommendations. The system features a robust backend using Firebase for auth/data and Prisma ORM, interactive UI with Shadcn UI and Tailwind CSS, and data visualization via Recharts. Includes drag-and-drop tracking (DnD Kit) and strict schema validation with Zod.

Gemini 2.5 Flash Genkit AI Next.js 15 React 18 TypeScript Firebase Prisma ORM Tailwind CSS Shadcn UI Zod React Hook Form Recharts DnD Kit Radix UI
● Live

Nexus: AI Knowledge Cartographer

An interactive research assistant that visualizes complex topics as a dynamic knowledge graph using the Gemini 2.5 Flash model. Built with a React/TypeScript frontend, it features a real-time chat interface to initiate graph generation. The application leverages the Gemini API's Structured Output (JSON) capability to precisely define Knowledge Nodes and Edges, which are rendered in an interactive SVG visualizer.

Gemini 2.5 Flash React & TypeScript Knowledge Graph Generation Structured Output (JSON) Tailwind CSS & Vite
● Live

CodePrep | Interview Platform

A comprehensive Full-Stack algorithm practice platform engineered with Next.js 14 (App Router) and TypeScript. Features a secure Remote Code Execution sandbox, robust authentication via Supabase Auth, and persistent data storage using PostgreSQL. Implements Row Level Security (RLS) for data protection, Server Actions for mutation, and a responsive UI built with Tailwind CSS and Shadcn UI.

Next.js 14 TypeScript Supabase & PostgreSQL Server Actions Tailwind & Shadcn
● Live

Real-Time Chat Platform

A full-stack messaging application built with the MERN stack. Features real-time communication via Socket.io, secure JWT authentication, global state management with Zustand, and live user status updates.

MERN Stack Socket.io Tailwind & DaisyUI JWT
● Live

Full-Stack E-Commerce App

A production-ready platform engineered with the MERN stack and Ant Design. Features a comprehensive RESTful API, secure JWT authentication, admin dashboard for inventory management, and integrated Braintree payment processing.

MERN Stack Ant Design Braintree Payments Context API

Experience

AI Research Assistant (Remote)

Univ. of Colorado Boulder — Centre for UnlieAI Artificial Intelligence Research Lab

Jan 2025 – Present

  • Constructed automated data pipelines to process 500GB+ of complex multimodal medical imaging and NLP datasets, decreasing data preparation time by 50%.
  • Co-authored 3 high-impact research papers currently under review at top-tier venues (KDD, ACL, SaTC) by driving experimental design and scalable AI methodologies.

Cloud Infrastructure & Automation Specialist

The Data Island, Dhaka

Jun 2024 – Oct 2025

  • Architected and deployed highly scalable multi-tenant cloud infrastructure across AWS and GCP using Docker and Kubernetes, achieving 99.9% system availability.
  • Accelerated deployment cycles by 40% by engineering automated CI/CD pipelines for enterprise AI applications.
  • Optimized REST APIs and high-availability PostgreSQL databases, reducing average query response time by 35% and enabling 5,000+ concurrent requests.

Vision-Language Model Researcher

Machine Learning Lab, North South University

Jan 2025 – Oct 2025

  • Spearheaded the benchmarking of state-of-the-art VLMs (CLIP, ViLT, BioViL-T) across 10,000+ data points to identify architectural bottlenecks.
  • Developed automated evaluation scripts for RadMultiBench, accelerating multimodal medical model testing cycles by 3x.

Research Assistant

Cyber Physical Systems Lab, North South University

Jun 2023 – Jun 2024

Developed intelligent sensor networks and implemented ML algorithms for predictive maintenance in IoT systems.

Teaching

AI/ML Lab Instructor

North South University

Feb 2025 – Current

  • Instructed core AI/ML laboratory sessions for 50+ students, delivering hands-on PyTorch and TensorFlow demonstrations that improved average class performance by 15%.
  • Engineered an automated Python-based grading system, reducing assignment evaluation time by 80% while providing instant, accurate feedback.

University Teaching Assistant

North South University

Jun 2024 – Jan 2025

Assisted faculty with undergraduate courses. Supervised labs for programming (Python, Java, C/C++) and held sessions on algorithms and data structures.

Education & Certifications

Education

Bachelor of Science, Computer Science and Engineering

North South University, Dhaka

Jan 2020 – Apr 2024

  • Final Grade: 3.81/4.00 (Summa Cum Laude)
  • Honors: Dean's List recognition for academic excellence. EQF Level 6.
  • IELTS: C1 (Advanced)

Higher Secondary School Certificate (12th)

Uttara High School & College, Dhaka

2017 – 2019

Result: 80–100% (A*/High First)

Secondary School Certificate (10th)

Uttara High School & College, Dhaka

2015 – 2017

Result: 80–100% (A*/High First)

Certifications

  • FastAI Deep Learning Course
  • Stanford Machine Learning Specialization (Coursera)
  • Andrew Ng Deep Learning Specialization (Coursera)
  • freeCodeCamp Data Science Certification
  • Google Cloud Platform Fundamentals