Hi, I'm Smyan

I'm a 3rd year Computer Science student at Northeastern University, passionate about building software and solving problems.

Smyan Sengupta

Experience

Machine Learning and Data Analysis Co-op

(MSAT Modeling)
July 2025 - Present
PfizerAndover, MA
  • Performing data analysis on lab datasets for an $11 billion project using XGBoost, Random Forest, Scikit-learn Gradient Boosting, and Support Vector Regression
  • Executing and automating extensive data cleaning, organization, and analysis processes
  • Engineering a spaCy-based chatbot to provide automated insights on experimental data and process documentation

Co-Founder and Vice President

May 2025 - Present
MedCS LabBoston, MA
  • Leading the initiative to establish an interdisciplinary undergraduate research group at the intersection of computer science/data science and medical fields
  • Managing communication platforms (Discord, Notion) and facilitating coordination between members and advisors

Founding Member and Hackathon Head

August 2025 - Present
Northeastern Claude Builder ClubBoston, MA
  • Spearheading the planning and execution of a 24-hour competitive programming event
  • Managing budgeting and sponsorship coordination

Founding Member and Treasurer

September 2024 - Present
Northeastern Association for Computing MachineryBoston, MA
  • Managing all finances, fund disbursements, and meeting requirements
  • Organizing meetings, mixers, and collaborative events with other clubs
  • Coordinating with professors for speaking engagements

Teaching Assistant – Foundations of Data Science

September 2024 - April 2025
Khoury College of Computer SciencesBoston, MA
  • Facilitating 6+ office hours per week, leading project meetings, and proctoring labs/exams
  • Grading 90+ assignments per week and providing useful feedback on data analysis, linear algebra, statistics, and machine learning concepts

Tools I Use

Projects

Guardrails: Atomic

1st Place ALIHacks 2025

Built an AI-powered formal verification platform that generates mathematically verified code from natural language inputs. Engineered a YAML-to-Z3 conversion pipeline with automated counterexample generation for program verification.

TypeScriptNext.jsZ3 Theorem ProverMongoDBOpenRouter

NewsFactChecker

Developed and trained a fact-checker AI model to determine the amount of misinformation in news articles. Utilized Bayesian Inference with Hamiltonian Monte Carlo sampling to calculate probabilities of misinformation.

PythonBayesian InferenceMachine Learning

OpenLegislation

1st Place HackHarvard 2024 - Open Data Track

Developed a web application to make current Congress bills more accessible and understandable for common users, using the OpenAI API for vector search and simplification.

ReactTailwindCSSCongress.gov APIOpenAI API

HealthSync

Developed a mobile application that analyzes user health data and health-related journal entries to provide users with AI-powered health analysis and recommendations.

FlutterPythonMongoDB AtlasGemini API

Second Sight

Developed an AI-powered journaling application where users can log their mood for the day, create journal entries, and view entries over time. Leveraged Gemini AI to analyze entries and mood trends to assist users in understanding themselves better.

FlutterKotlinGemini API

Hoo Wants A Degree

Developed a degree planner web application to aid University of Virginia School of Engineering students in creating AI-generated four-year degree plans.

ReactPerplexity AI API

Stocks Simulator

Developed a full-stack Java application using MVC architecture for users to create, update, and maintain stock portfolios. Incorporated algorithms to calculate stock gain/loss, moving averages, and portfolio rebalancing with real-time data for 1000+ stocks.

JavaRESTful APIsMVC ArchitectureSwingAlpha Vantage API

Contact

Feel free to reach out if you'd like to connect or collaborate!