Shane Zarechian - Portfolio
Profile of Shane Zarechian

Shane Zarechian

software

data

developer

UNH, Durham, New Hampshire

About

Computer Science student at the University of New Hampshire that likes to explore data science and engineering, ML, and fintech. Currently learning more about reinforcement learning, OpenCV, and clustering algorithms.

Experience

  • -

    Data Science Intern @ HouseNovel

    Durham, NH - Remote

    Summary:

    • Developed OCR pipelines using PaddleOCR, Tesseract, and OpenAI API to digitize historic records; processed 85k+ pages and transformed 5M+ entries into structured data.

    Responsibilities:

    • Performed OCR on 85,000+ scanned historic documents using PaddleOCR, OpenAI API, and Tesseract
    • Improved transcription accuracy by 40% by fine-tuning an open-source OCR engine
    • Preprocessed 100+ GB of images for OCR extraction with OpenCV, scikit-image, and regex
    • Grouped structural elements into 3 categories by implementing DBSCAN clustering on documents with scikit-learn
    • Converted 5M+ unformatted entries into structured JSON using regular expressions, ingested results into PostgreSQL
    • -

      Backend Developer @ UNH Center for Business Analytics & NHADC

      Durham, NH - Hybrid

      Summary:

      • Enhanced scraping pipelines and backend architecture for NHADC using Python, multithreading, OpenAI API, and FastAPI microservices.

      Responsibilities:

      • Enhanced scraping and data processing speed by 50%+ by leveraging multithreading in Python
      • Automated data extraction for 350+ Pydantic fields using the OpenAI API, enabling rapid analysis of raw HTML
      • Refactored Streamlit app into FastAPI microservices across 16 endpoints and deployed on Render
      • Implemented end-to-end Pytest suite with 100% coverage of API endpoints, ensuring stability and detecting bugs
      • Contributed to backend development strategy and collaborated with a team of 4 to ensure efficient implementation
      • -

        Sales Representative @ Sunrun

        Portsmouth, NH - On-site

        Summary:

        • Sales Representative responsible for establishing trust with clients, setting appointments for energy consultations, and connecting with hundreds of clients to build value in services.

        Responsibilities:

        • Established trust and served as first point of contact for clients to set appointments for energy consultations
        • Connected with hundreds of clients and established value in company services
        • Reached monthly sales goals through effective communication

        Projects

        Education

        Skills

        • Python
        • Pandas
        • SQLAlchemy
        • DuckDB
        • BeautifulSoup
        • FastAPI
        • Polars
        • NumPy
        • PostgreSQL
        • Django
        • Flask
        • Dagster
        • Selenium
        • CSS
        • Pydantic
        • Git
        • Pytest
        • HTML5
        • Apache Superset
        • Docker
        • Ollama
        • OpenAI API
        • Plotly
        • Anaconda
        • Metabase
        • Bootstrap
        • Numba
        • Jupyter
        • Railway
        • OpenCV
        • PaddlePaddle
        • Airbyte
        • Astro
        • GitLab
        • PyTorch
        • scikit-Learn
        • Azure
        • Render
        • JSON
        • Apache Parquet
        • Linux
        • Bash
        • PyPy
        • PyPI
        • Optuna
        • Loguru
        • Grafana
        • MongoDB
        • Supabase
        • PyCharm
        • DataGrip
        • IntelliJ
        • Discord API
        • Java
        • Kubernetes
        • AWS
        • Spring
        • Linear
        • Clickhouse
        • Cloudflare