CV

Objective

I am a program manager and data analyst focused on govtech and digital public service design. I blend tech policy, responsible innovation, and technical expertise to drive impact by using technology to solve problems related to social justice, sustainability, and public policy.

With over a decade of social science research, I apply both qualitative and quantitative methods to create data-driven solutions that improve public services.

Education

Viterbi School of Engineering 2021

  • Data Analytics Certificate

Middlebury College   2018

  • Summer Language School, Russian                 

Aberystywth University  2015

  • Master’s International Politics

University of Rochester 2014

  • Bachelors of Arts: History, International Relations, Russian

Experience

AI and Data Governance Consultant: 

Remote | October 2018 - December 2021, January 2023-Present

Self-Employed

  • Content creator for GovTech and Government data visualization (2024-present)

  • Analyzed and designed visualizations from NYC’s algorithmic open-source compliance data (2024)

  • Published 12 articles on the beneficial uses of AI and emerging technology in society (2018-present)

  • Designing and implementing BetaNYC’s Associate Board bylaws and standard operating procedure (2024-present)

  • Advised product managers at Rossum AI on policy implications of their AI tool, quoted in Read/Write (2021)

  • Collaborated with water engineers to develop an XPRIZE prize design on black-to-potable water technology (2020)


Progressive Policy Institute: 

Washington, D.C. | January 2022 - December 2023

Economic and Data Policy Analyst, Director of Innovation Frontier Project

  • Lead data analyst for Investment Heroes Project. 

  • Managed ten years of Investment Heroes Project data, over 130,000 data points

  • Pioneered a Python script for automatic data collection using the Security and Exchange Commission API, the first in the project’s 10-year history

  • Supervised junior staff to clean and impute Investment Heroes Data

  • Set emerging technology policy strategy for PPI, Program Director managing a budget of $100,000

  • Wrote 10 papers, blogs, articles, and reports on AI, privacy and data protection


Global Student Embassy:

Berkeley, CA | August 2016 - August 2018                                         

Director of Operations, Travel and Outreach Manager

  • Managed client database of over 10,000 clients

  • Updated and maintained entity-relationship diagram for client database

  • Updated and maintained documentation for Salesforce database as well as educating non-technical staff database use and best practices

  • Program manager with $70,000 budget for international development projects between California and Latin America

  • Developed sustainable development training and programming content for 10 projects across 4 countries


Senator Bernie Sanders:

Burlington, VT | January 2015 - June 2016

Constituent Advocate Intern

  • Managed 20 constituent advocacy cases for the Senator’s district office

  • Liaised with the Department of Veterans Affairs, the Department of Corrections, and the Social Security Administration to help Vermont constituents engage with federal agencies

Select Projects and Speaking Engagements

Data Projects: https://github.com/jshapi16

NYC Administrative Code LLM,

https://github.com/jshapi16/nyc_admin_llm

  • A domain-specific large language model (LLM) that uses retrieval-augmented generation (RAG) 

  • Currently Retrieves Title 1, 8 and 10 of the NYC Administrative Code

  • Uses flan-t5-base for retrieval, claude-3-7-sonnet for question/answering, bart-base for embedding

NYC Algorithmic Tools Compliance Analysis, https://github.com/jshapi16/NYC_alg_compliance

  • Using the reporting data from Local Law 35, 2022, which requires city agencies to report on their algorithmic usage. Performed data cleaning ran NLP analysis on descriptive columns to extract vendor information from descriptive columns. Created data visualizations using Matplotlib, which were published on  @GovTechGal instagram. 

  • Languages, libraries: Python, Matplotlib, NTLK, GenismSeaborn

Investment Heroes 2022: Washington, DC,  https://www.progressivepolicy.org/publication/investment-heroes-2023/

  • Study Question: How to measure Fortune 500 companies' U.S. capital spending?

  • Languages, libraries, API: Python, Pandas, Securities and Exchange Commission (SEC) API

  • Methods and Results: Lead data analyst for a dataset with ten years of data featuring 13,000 new data points per year. I wrote a Python script to pull new financial data from the SEC Edgar Filing API. I then performed statistical analysis using a proprietary methodology to estimate companies' U.S. capital spending.

Policy Research:

Op-Eds:

Panelist/Speaker:

  • Sprite+ Hub, An Overview of American Digital Privacy

  • Women's History Month Forum at the House of Representatives, Privacy in a Post-Roe World, March 2023 

Interview/Quotes

  • Bloomberg, Musk, Zuckerberg Lead Parade of Tech Titans to Senate AI Event

  • Bloomberg Government, What to Know in Washington: Congress Faces A.I. Learning Curve

  • Politico, "How Governments can keep up with the future."

  • Channel News Asia Documentary, "The Deepening US-China Tech War"


 

Languages

English (Fluent)

Spanish (Advanced)

Russian (Intermediate)

Chinese (Beginner)

Fellowships, Awards, and Affiliations

  • BetaNYC Associate Board Member (Co-Chair of Governance) 2024-present

  • Sprite+ Expert Fellow (2023-Present)

  • Rotary Global Grant Scholar, 2014-2015

  • Americorp’s Urban Fellow, 2012

  • Fulbright US-UK Summer Institute, Wales, 2011