Alfonso Bonilla

Data Scientist, DJ, & Language Enthusiast

Profile Pic

Alfonso Bonilla

Experienced NLP data scientist and linguist who leverages deep learning technologies to tackle various challenges and understand language


I am a data scientist that specializes in natural language processing and machine learning. Through my experiences, I built an automatic abstractive summarizer, a fraud detector, and a similar products recommender. In addition, I analyzed data relating to HIV Infection rates, Doctor-Patient Relationships, and Language Perception. At the core of all my experiences is Human Language and the goal of understanding how its used and how we can use it for novel and informative purposes.


    • Key Skills

      • Machine Learning & Statistics
      • Research & Experimental Design
      • NLP & Computational Linguistics
      • Data Mining & Processing
      • Visualization & Exploratory Analysis
    • Programming Languages

      • Python
      • MatLab
      • R
      • Bash
      • SQL
      • JavaScript
      • HTML, CSS, & jQuery
    • Software

      • NLTK (Natural Language ToolKit)
      • SciKit Learn (Machine Learning Toolkit)
      • iPython & Jupyter Notebooks
      • Version Control (git & svn)
      • MapReduce, Hadoop, & Spark
      • reStructured Text (rst)
      • Praat (Acoustic Analysis)
      • Audacity & Ableton (Audio Processing)
      • RedCap & XNAT
      • Microsoft Office (Word, Excel, PPT, Access, Outlook)
    • Lanuguages

      • English (Native)
      • Spanish (Native)
      • Korean
      • Mandarin Chinese
      • Portuguese


    M.S. Computational Linguistics

    • University of Washington
    • GPA: 3.74
    • July 2016 - Dec 2017

      B.A. Linguistics & Molecular Cellular Biology - Neurobiology

      • UC Berkeley
      • GPA: 3.50
      • Jan 2011 - May 2015


    Data Scientist

    • Walmart Labs
    • Sunnyvale, CA
    • Oct 2017 - Current
    • Create text-based similar products models for Walmart's grocery website
    • Develop production data pipelines for utilizing Hive and Python
    • Conduct deep learning research

    Data Science Intern

    • WePay
    • Redwood City, CA
    • June 2017 - Oct 2017
    • Conducted research in anomaly detection using character-based deep learning models
    • Detected Fraud by using Neural Networks, Random Forest, and various machine learning techniques
    • Designed and coded deep learning models using Tensorflow
    • Explored the impact character-based deep learning models in anomaly detection

    Dialog Systems Intern

    • KITT.AI
    • Seattle, WA
    • Aug 2016 - Nov 2016
    • Developed technical documentation for ChatFlow
    • Created video tutorials explaining ChatFlow and the process of building bots with ChatFlow
    • Built and integrated dialogue systems into the Alexa, Facebook Messenger, & Telegram environments
    • Integrated third-party API's into Dialogue Systems

    Research Associate

    • SRI International
    • Menlo Park, CA
    • Aug 2015 - May 2016
    • Maintained HIPAA compliant database for various health related projects
    • Evaluated data quality by leveraging statistic algorithms
    • Created tutorials for python libraries and RedCAP (technical software)
    • Developed data pipelines for human generated data using Python & Bash
    • Resolved data quality issues by coordinating with five data acquisition sites throughout the US

    Research Assistant

    • UCB Helen Wills Neuroscience
    • Berkeley, CA
    • Oct 2012 - May 2015
    • Formulated research question and experimental design for a Perceptual Experiment
    • Collected human perception data using a force-choice paradigm experimental design
    • Presented results using Microsoft Power Point and poster at national conferences (ABCRMS)

    Research Assistant

    • JHU School of Public Health
    • Baltimore, MD
    • May 2014 - Aug 2014
    • Cleaned and processed text data using Python
    • Identified statistically significant factors associated with Doctor-patient interactions
    • Modeled Doctor-patient interactions using Regression
    • Presented study and results using Power Point and poster at national conferences (SACNAS)
    • Wrote study and results in a research paper

    Research Assistant

    • UCSF
    • San Francisco, CA
    • May 2012 - Aug 2012
    • Performed apoptotic assays measuring impact of cell death on the cellular community
    • Engineered an artificial receptor to test new technology and troubleshoot non-specific binding
    • Presented results using Power Point and poster at university-wide symposium

    Research Intern

    • Stanford University
    • Palo Alto, CA
    • June 2009 - Aug 2009
    • Engineered artificial protein, transduced the engineered protein into mammalian cells, and visualized the modified cells using microscopy
    • Presented study and results using Power Point and Poster at an university-wide symposium


    Surveillance or Engagement: Children's Conflicts During Health Maintenance Visits


  • NIH-MARC (National Institute's of Health Maximizing Access to Research Careers) Fellowship: 2 years of Research Funding
  • UCLEADS (University of California Leadership Excellence through Advanced DegreeS) Fellowship: 2 years of Research Funding
  • Biology Scholars Program
  • Poster Presentations at SACNAS and ABRCMS (National Conferences)
  • Special Merit Poster Presentation Award, UCLEADS Annual Symposium


Design & Visual Work

Programming & Coding


Back in 2006, a teenage boy discovered KPOP and the magic of mixing different genres of music together. Over the years, he left his home and took his mixing out of the bedroom performing for drunken frat boys, raging liberals, and fellow music fans. He has grown from a little caterpillar and has become a firey disco butterfly!