cv

Super fortunate to have been part of lot of these!

General Information

Full Name Arnav Sareen
Location Charlotte, NC, USA
Email asareen2@charlotte.edu
LinkedIn https://linkedin.com/in/arnavsareen
GitHub https://github.com/aesareen

Education

  • 2023–2027
    University of North Carolina at Charlotte
    Charlotte, NC
    • Bachelor of Science in Computer Science (AI Concentration), Bachelor of Science in Data Science.
    • Honors: Levine Scholar (Premier Merit Scholarship), Chancellor's List — Fall 2023 - Spring 2025.
    • GPA: 4.0/4.0
  • Jan–May 2026
    Tampere University
    Tampere, Finland
    • Exchange Studies.

Experience

  • May 2026–
    Incoming Fulbright Canada MITACS Globalink Research Intern
    University of Waterloo, Waterloo, ON, Canada
  • Aug 2025–Present
    Teaching Assistant
    UNC Charlotte College of Computing & Informatics, Charlotte, NC
    • Develop and grade hands-on data mining exercises on clustering, classification, and regression, providing detailed feedback that enhances student comprehension and problem-solving skills.
    • Lead student code review and office hours for a Data Mining course of 70+ students, guiding them through classical machine learning techniques.
  • Jul 2025–Present
    E3 Technical Fellow
    CodePath, Charlotte, NC
    • Mentor 20+ students in computer science and interview preparation, increasing technical interview confidence and results through 1:1 guidance.
    • Facilitate mock technical interviews, helping students improve problem-solving efficiency and coding quality through targeted feedback and debugging support.
  • Sep–Dec 2025
    Ignite Fellow
    Teach for America, Remote
    • Taught middle school students key mathematical topics, increasing engagement, comprehension, and performance via tailored tutoring sessions.
  • May–Dec 2025
    Data Science and Analytics Intern
    Trane Technologies, Davidson, NC
    • Pioneered a hybrid, ensemble-based recommendation engine using matrix factorization and deep learning collaborative filtering algorithms to drive a projected $3M in sales and increase user engagement by 30%.
    • Designed a complementary retrieval-augmented generation (RAG) system leveraging AWS Bedrock Agents and Lambda to query Redshift and generate contextualized responses from enterprise data, increasing recommendation effectiveness by 60%.
    • Engineered custom metrics to measure recommendation quality by analyzing business impact via transaction patterns, customer overlap, and item taxonomy that prioritizes more relevant and grounded recommendations.
    • Performed customer segmentation using K-Means clustering to support a waterfall pricing strategy, improving customer spend and optimizing discount thresholds.
  • May–Jun 2025
    SPOA Research Assistant
    UNC Charlotte Athletics, Davidson, NC
    • Developed intuitive spreadsheets and databases to provide Charlotte Athletics administrators and coaches with clear financial insights, enabling informed decision-making across 17 varsity sports.
    • Architected and implemented a tracking system to manage NIL and performance-based bonus payments, streamlining financial operations within Charlotte Athletics' annual $35M budget.
  • Jun 2024–Jun 2025
    Data Collector
    Pro Football Focus (PFF), Remote
    • Collected player participation data for FCS and FBS teams during the 2024-2025 season, tracking personnel and position data across 60–70 plays per game.
    • Tracked over 120 data points per play with a 98.5% accuracy rate, enabling timely and accurate delivery of game data to clients and improving the quality of downstream analyses.
  • Jun 2024–Present
    Editor-in-Chief & Web Developer and Media Editor
    ETHEL Undergraduate Research Journal, Charlotte, NC
    • Re-established the Undergraduate Research Journal at UNC Charlotte, spearheading administrative and student outreach efforts behind the university's premier journal for undergraduate research.
    • Develop the website and curate the digital version of the journal using WordPress and Bricks Builder, implementing SEO and accessibility best practices.
    • Review student submissions and provide editorial feedback to improve clarity, structure, and impact across a variety of disciplines.
  • Aug 2023–Present
    AI & HPC Researcher
    Data Intelligence Research (DIR) Lab, UNC Charlotte
    • Analyze the performance of large language models for interpreting complex high-performance computing (HPC) logs via a scientific paper-based RAG system, reducing hallucination rates.
    • Enhance the performance of language models for I/O trace log analysis, increasing the consistency and accuracy of results by 80% through prompt engineering and evaluation techniques.
    • Utilize LLM evaluation frameworks to benchmark system improvements and guide modifications to the RAG pipeline, increasing development speed and impact.
  • May–Sep 2024
    Data Science and AI Engineer Intern
    Charlotte Works, Charlotte, NC
    • Developed a custom AI assistant and chatbot, improving model performance by 70% through advanced RAG techniques such as HyDE and hybrid retrieval and custom data preprocessing.
    • Analyzed labor market data using Python and statistical tools to uncover key labor trends and growth opportunities in Mecklenburg County, leading to actionable suggestions for economic development.
  • Jun–Aug 2022
    ML Research Assistant
    PICTure Research Group, Raleigh, NC
    • Improved the efficiency of machine learning-based learned indexes in high-write environments through an optimized retraining model, reducing accuracy degradation by 60%.
    • Presented the capabilities of learned indexes at the NCSSM Mentorship Research Symposium to a diverse audience.

Publications

  • C. Egersdoerfer, A. Sareen, J. L. Bez, S. Byna, D. D. Xu, and D. Dai, "IOAgent: Democratizing Trustworthy HPC I/O Performance Diagnosis Capability via LLMs," 2025 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Milano, Italy, 2025, pp. 322–334. doi: 10.1109/IPDPS64566.2025.00036.
  • C. Egersdoerfer, A. Sareen, J. L. Bez, S. Byna, and D. Dai, "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems (HotStorage), Santa Clara, CA, USA, 2024, pp. 86–92. https://dl.acm.org/doi/10.1145/3655038.3665950.

Technical Presentations

  • "Exploring the Performance of AI-Based HPC I/O Performance Diagnoses Across Multiple Models," State of North Carolina Undergraduate Research and Creativity Symposium, November 2025, Elon, NC (Poster).
  • "I/O Navigator: Enhancing and Benchmarking LLMs for HPC I/O Performance Diagnosis," National Conference on Undergraduate Research, April 2025, Pittsburgh, PA (Poster).
  • "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," Undergraduate Research Initiative Final Poster Presentation, May 2024, Charlotte, NC (Poster).
  • "ION: Navigating the HPC I/O Optimization Journey using Large Language Models," UNC Charlotte Undergraduate Research Symposium, April 2024, Charlotte, NC (Poster).

Awards

  • George Barthalmus Undergraduate Research Award: promotes early involvement in research by supporting sophomores in projects of their own design. Awarded 2024.
  • Levine Scholarship: UNC Charlotte's premier undergraduate merit scholarship for students who exemplify intellectual curiosity, commitment to community service, and ethical leadership. Provides up to eight semesters of educational costs plus funding for four summer experiences in outdoor leadership, nonprofit service, pre-professional development, and international education. Awarded 2023.
  • National Cyber Scholar with Honors: earned a top score among high school students nationwide by solving real-world cybersecurity problems and completed the GIAC Foundational Cybersecurity Technologies (GFACT) exam with one of the top 500 scores in the cohort. Awarded 2023.

Technical Skills and Other Affiliations

Courses Data Structures & Algorithms, Computer Systems, Database Design, Operating Systems & Networks, Natural Language Processing, Artificial Intelligence, Machine Learning, Data Mining, Applied Regression, Multivariate Analysis.
Languages Python, Java, SQL, C/C++, SAS, JavaScript, HTML/CSS, RISC-V.
Developer Tools Git, VS Code, AWS SageMaker, AWS Bedrock, AWS Redshift, BigQuery, PyCharm, IntelliJ.
Libraries and Tools Pandas, Polars, NumPy, Matplotlib, DsPy, ChromaDB, Plotly, Scikit-learn, spaCy, BeautifulSoup, Streamlit, Selenium, Flask.
Clubs and Organizations Sports Analytics Club Member, Charlotte AI Research Club Member.