cv
Super fortunate to have been part of lot of these!
General Information
| Full Name | Arnav Sareen |
| Location | Charlotte, NC, USA |
| asareen2@charlotte.edu | |
| https://linkedin.com/in/arnavsareen | |
| GitHub | https://github.com/aesareen |
Education
-
2023–2027 University of North Carolina at Charlotte
Charlotte, NC - Bachelor of Science in Computer Science (AI Concentration), Bachelor of Science in Data Science.
- Honors: Levine Scholar (Premier Merit Scholarship), Chancellor's List — Fall 2023 - Spring 2025.
- GPA: 4.0/4.0
-
Jan–May 2026 Tampere University
Tampere, Finland - Exchange Studies.
Experience
-
May 2026– Incoming Fulbright Canada MITACS Globalink Research Intern
University of Waterloo, Waterloo, ON, Canada -
Aug 2025–Present Teaching Assistant
UNC Charlotte College of Computing & Informatics, Charlotte, NC - Develop and grade hands-on data mining exercises on clustering, classification, and regression, providing detailed feedback that enhances student comprehension and problem-solving skills.
- Lead student code review and office hours for a Data Mining course of 70+ students, guiding them through classical machine learning techniques.
-
Jul 2025–Present E3 Technical Fellow
CodePath, Charlotte, NC - Mentor 20+ students in computer science and interview preparation, increasing technical interview confidence and results through 1:1 guidance.
- Facilitate mock technical interviews, helping students improve problem-solving efficiency and coding quality through targeted feedback and debugging support.
-
Sep–Dec 2025 Ignite Fellow
Teach for America, Remote - Taught middle school students key mathematical topics, increasing engagement, comprehension, and performance via tailored tutoring sessions.
-
May–Dec 2025 Data Science and Analytics Intern
Trane Technologies, Davidson, NC - Pioneered a hybrid, ensemble-based recommendation engine using matrix factorization and deep learning collaborative filtering algorithms to drive a projected $3M in sales and increase user engagement by 30%.
- Designed a complementary retrieval-augmented generation (RAG) system leveraging AWS Bedrock Agents and Lambda to query Redshift and generate contextualized responses from enterprise data, increasing recommendation effectiveness by 60%.
- Engineered custom metrics to measure recommendation quality by analyzing business impact via transaction patterns, customer overlap, and item taxonomy that prioritizes more relevant and grounded recommendations.
- Performed customer segmentation using K-Means clustering to support a waterfall pricing strategy, improving customer spend and optimizing discount thresholds.
-
May–Jun 2025 SPOA Research Assistant
UNC Charlotte Athletics, Davidson, NC - Developed intuitive spreadsheets and databases to provide Charlotte Athletics administrators and coaches with clear financial insights, enabling informed decision-making across 17 varsity sports.
- Architected and implemented a tracking system to manage NIL and performance-based bonus payments, streamlining financial operations within Charlotte Athletics' annual $35M budget.
-
Jun 2024–Jun 2025 Data Collector
Pro Football Focus (PFF), Remote - Collected player participation data for FCS and FBS teams during the 2024-2025 season, tracking personnel and position data across 60–70 plays per game.
- Tracked over 120 data points per play with a 98.5% accuracy rate, enabling timely and accurate delivery of game data to clients and improving the quality of downstream analyses.
-
Jun 2024–Present Editor-in-Chief & Web Developer and Media Editor
ETHEL Undergraduate Research Journal, Charlotte, NC - Re-established the Undergraduate Research Journal at UNC Charlotte, spearheading administrative and student outreach efforts behind the university's premier journal for undergraduate research.
- Develop the website and curate the digital version of the journal using WordPress and Bricks Builder, implementing SEO and accessibility best practices.
- Review student submissions and provide editorial feedback to improve clarity, structure, and impact across a variety of disciplines.
-
Aug 2023–Present AI & HPC Researcher
Data Intelligence Research (DIR) Lab, UNC Charlotte - Analyze the performance of large language models for interpreting complex high-performance computing (HPC) logs via a scientific paper-based RAG system, reducing hallucination rates.
- Enhance the performance of language models for I/O trace log analysis, increasing the consistency and accuracy of results by 80% through prompt engineering and evaluation techniques.
- Utilize LLM evaluation frameworks to benchmark system improvements and guide modifications to the RAG pipeline, increasing development speed and impact.
-
May–Sep 2024 Data Science and AI Engineer Intern
Charlotte Works, Charlotte, NC - Developed a custom AI assistant and chatbot, improving model performance by 70% through advanced RAG techniques such as HyDE and hybrid retrieval and custom data preprocessing.
- Analyzed labor market data using Python and statistical tools to uncover key labor trends and growth opportunities in Mecklenburg County, leading to actionable suggestions for economic development.
-
Jun–Aug 2022 ML Research Assistant
PICTure Research Group, Raleigh, NC - Improved the efficiency of machine learning-based learned indexes in high-write environments through an optimized retraining model, reducing accuracy degradation by 60%.
- Presented the capabilities of learned indexes at the NCSSM Mentorship Research Symposium to a diverse audience.
Publications
- C. Egersdoerfer, A. Sareen, J. L. Bez, S. Byna, D. D. Xu, and D. Dai, "IOAgent: Democratizing Trustworthy HPC I/O Performance Diagnosis Capability via LLMs," 2025 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Milano, Italy, 2025, pp. 322–334. doi: 10.1109/IPDPS64566.2025.00036.
- C. Egersdoerfer, A. Sareen, J. L. Bez, S. Byna, and D. Dai, "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems (HotStorage), Santa Clara, CA, USA, 2024, pp. 86–92. https://dl.acm.org/doi/10.1145/3655038.3665950.
Technical Presentations
- "Exploring the Performance of AI-Based HPC I/O Performance Diagnoses Across Multiple Models," State of North Carolina Undergraduate Research and Creativity Symposium, November 2025, Elon, NC (Poster).
- "I/O Navigator: Enhancing and Benchmarking LLMs for HPC I/O Performance Diagnosis," National Conference on Undergraduate Research, April 2025, Pittsburgh, PA (Poster).
- "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," Undergraduate Research Initiative Final Poster Presentation, May 2024, Charlotte, NC (Poster).
- "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," UNC Charlotte Undergraduate Research Symposium, April 2024, Charlotte, NC (Poster).
Awards
- George Barthalmus Undergraduate Research Award: promotes early involvement in research by supporting sophomores in projects of their own design. Awarded 2024.
- Levine Scholarship: UNC Charlotte's premier undergraduate merit scholarship for students who exemplify intellectual curiosity, commitment to community service, and ethical leadership. Provides up to eight semesters of educational costs plus funding for four summer experiences in outdoor leadership, nonprofit service, pre-professional development, and international education. Awarded 2023.
- National Cyber Scholar with Honors: earned a top score among high school students nationwide by solving real-world cybersecurity problems and completed the GIAC Foundational Cybersecurity Technologies (GFACT) exam with one of the top 500 scores in the cohort. Awarded 2023.
Technical Skills and Other Affiliations
| Courses | Data Structures & Algorithms, Computer Systems, Database Design, Operating Systems & Networks, Natural Language Processing, Artificial Intelligence, Machine Learning, Data Mining, Applied Regression, Multivariate Analysis. |
| Languages | Python, Java, SQL, C/C++, SAS, JavaScript, HTML/CSS, RISC-V. |
| Developer Tools | Git, VS Code, AWS SageMaker, AWS Bedrock, AWS Redshift, BigQuery, PyCharm, IntelliJ. |
| Libraries and Tools | Pandas, Polars, NumPy, Matplotlib, DsPy, ChromaDB, Plotly, Scikit-learn, spaCy, BeautifulSoup, Streamlit, Selenium, Flask. |
| Clubs and Organizations | Sports Analytics Club Member, Charlotte AI Research Club Member. |