About Me

Who Am I?

Hi I'm Jaqueline Lu. I am studying Computer Science in University of California, Los Angeles (UCLA) with a focus of software engineering, data science and computer vision.

I fell in love with data as I believe machines and technologies are the future we are pacing towards.With data at hand, I am able to look deeply into our society and human behaviors, and protect or prevent them.

Starting from 2020, since the COVID-19 broke out, I have also joined the 'Wuhan2020' team to collect data from hospitals, resources and "help" messages sent from those in Wuhan, then used data visualization to showcase the severity of the disease.

In my spare time, I like reading books, watching Sci-Fi movies, singing and travelling.

Download CV

Experience

Work Experience

Research Assistant on Data and IT Pytorch|Medical Imaging|Computer Vision 06/2020 - 08/2020

Employer: UCLA BIG Summer with Prof. Daniel Tward

What I do:

  • Developed an algorithm to build a “brain atlas” - a reference neuroimage - to quantify cell distribution patterns in brain
  • Created a PyPI package and Python modules with new scatter transform methods using convolution networks and gaussian downsample
  • Performed 2D and 3D image registration that combine multiple high-resolution images with deformations
  • 87% accuracy in alignment prediction using machine learning techniques like Linear Discriminant Analysis and Random Forest
  • Scattering downsample

Software and Data Science Intern Pytorch|Classification 05/2020 - present

Employer: Redux Recycle

What I do:

  • Developed a software for trash bins to classify trash objects
  • Use Resent50 model in Pytorch with some manipulations to learn massive recycling objects datasets, and predict their classes
  • Performed data mining skills and image augmentation to create thorough datasets with almost real images
  • Designed an algorithm with CNN model using Pytorch with 70+% accuracy of prediction, surpassing almost all existing methods
  • Redux

Data Discovery Specialist Machine Learning 03/2020 - 05/2020

Employer: Rainfoest Connection (RFCx)

What I do:

  • Train the AI model, review alerts from the acoustic monitoring machines put in the ecosystems to improve the ML models and create more new ML models.
  • Perform data cleaning, data selection, feature engineering and model selection on acoustic data collected.
  • Work with other engineers, developers and hackers to tackle problems from the machines and models.
Projects

Projects

Google Landmark Recognition 2020 Tensorflow | Computer Vision | Kaggle

Datasets from: GLDv2

What I do:

  • Using Tensorflow backend to first rank all training images by embedding similarity to test images, then performs geometric verification and re-ranking on most similar training images using KD tree
  • Fine-tune parameters to achieve a bronze medal on leaderboard
  • Landmark

    Flowers Recognition Tensorflow | Pytorch | Computer Vision | Kaggle

    Datasets from: Kaggle

    What I do:

  • Build a deep learning model using densenet169 in Pytorch and the one using efficientnet B7 in Tensorflow to classify 104 classes of flowers
  • Extracted information in tfrec data, performed data augmentation, then trained 70k datasets with TPU to achieve 97% accuracy
  • within 50 epochs in top 8% on leaderboard
  • Flowers

    Obstacles Detection for Self‐Driving Cars Pytorch

    What I do:

  • Built an object detection model for cars that analyzes short videos taken by webcam and detects obstacles like green or red traffic
  • lights, pedestrians, and other vehicles etc.
  • Combined with video analysis to create a pipeline, then examined with YOLOv5 and R-CNN algorithms
  • Trained on 15k real images Datasets with 70% accuracy within 1000 epoches
  • Detection Obstacles

    Machine Learning-based Super-Resolution MRI Tensorflow | Medical Imaging

    Datasets from: RAISR

    What I do:

  • Evaluated the accuracy of Deep Learning and Google RAISR algorithms - a ML technique that produces high-quality versions of low-resolution image - in MRI imaging datasets
  • Explored Enhanced Deep ResNet (EDSR) and Wide Activation Deep ResNet (WDSR) with Tensorflow
  • The result demonstrated RAISR was yet to be a qualified model for MRI analysis compared to EDSR
  • MRI image

    Categorical Feature Encoding Challenge II Tensorflow | Kaggle

    Datasets from: Kaggle

    see code

    What I do:

    • Using Tensorflow backend, I build a neural network model combined from Dense-NN, batch normalization and dropout for this datasets.
    • Since there are already too many columns, I do label encode for the features, and use Kfold validation to evaluate the models.
    • As a comparison, I also perform one hot encode (with more focus on the features themselves) and run catboost model on it. After tuning hyper-parameters, it also does a good job in prediction.

    NYC Taxi Duration Machine Learning | Kaggle

    Datasets from: Kaggle

    see code

    What I do:

    • The large dataset includes about 100k rows. I perform data selection and EDA based on data visiualization drawn to filter out the data that are most related, and do feature engineering (Principal Component Analysis to divide up the map of Manhattan into regions)
    • Perform model selection on different regression models (linear, Lasso, Ridge, SVR etc.) and make better use of the features by putting different weights

    Scrapy Program: House on Rent and on Sale Python | SQL

    What I do:

    • Using Python (requests and beautiful soup) and SQL to capture the housing rent and sale on certain websites, given the input from users specifying the name of the city and number of outputs.
    • It collects the information and transforms to an Excel spreadsheet that is used to create data visualization.

    My Specialty

    My Skills

    Python

    80%

    C/C++

    75%

    SQL

    75%

    Machine Learning Modules

    scikit-learn, pandas, matplotlib, seaborn

    80%

    Tensorflow

    60%

    Pytorch

    70%

    Microsoft Office

    Word, Excel, PowerPoint, Outlook

    90%

    Multilangual

    Mandarin, Cantonese, English

    90%
    Get in Touch

    Contact

    Los Angeles, California, US