Daniel T. Liao

Dallas TX · (469) 388-8097 · dtliaomr@gmail.com

I am a full-stack web developer experienced in HTML5, CSS3, JavaScript, React, NodeJS, and a data enthusiast who is looking to combine data science and business expertise to answer business problems using advanced analytical techniques.

Education

The University of Texas at Dallas

Master of Science
Information Technology and Management

GPA: 3.52

Coursework:

  • Applied Machine Learning
  • Big Data
  • Programming for Data Science
  • Statistics and Data Analysis
  • Cloud Computing
  • Business Data Warehousing
  • Business Analytics with R
  • System Analysis and Project Management
  • Objected-Oriented Programming in Java
  • Predictive Analytics Using SAS
Aug 2018 - Dec 2020

Indiana University Bloomington

Bachelor of Science in Recreation
Tourism, Hospitality, and Event Management

GPA: 3.0

Aug 2011 - Dec 2015

Project Experiences

Covid19 World Data

Azure - Data Factory, Storage Solutions, HDInsight, Databricks, MS PowerBI
  • Built a data engineering solution architecture using Azure technologies stack for data analysts and data scientists to create reports of covid-19 trends and prediction of the spread of virus
  • Integrated data from HTTP clients, Azure Blob Storage, and Azure Data Lake Gen2 using Azure Data Factory. Then created pipelines using control flow activities such as Lookup, Validation, ForEach, Delete, IfCondition, Get Metadata
  • Created and executed transformation logic and copy data from Data Lake Gen2 into Azure SQL Database using Data Flows, Azure HDInsight, and Azure Databricks Notebook Activities in the ADF pipelines
  • Implemented the Orchestration and Monitoring tool in the ADF pipelines using triggers, alerts, and reporting of metrics
Jan 2021 - May 2021

Truck Fleet Risk Factor Analysis

Hadoop, Hive, Pig, R, Shell, Tableau
  • Created process flow in Hadoop ecosystem to calculate risk factors using Pig script. Then imported, created, and loaded truck fleet data into Hive tables
  • Implemented a Logistic Regression model using R to identify risky drivers in the state of California
  • Created interactive dashboards through Tableau that pinpoints dangerous commercial truck drivers identified in regression model
Jan 2020 - Mar 2020

King County House Sales Data

Regression | Python - Pandas, NumPy, Matplotlib, Seaborn, ScikitLearn, Keras
  • Employed a Gradient Boosting Regression Model to predict the house sale prices sold between May 2014 and May 2015, and enhanced the R-squared score from 65.12% to 90.29% through feature scaling techniques
  • Applied machine learning algorithms including Regularized regression model, Polynomial Regression, KNN, SVM Regression and other ensemble learning techniques using Python Scikit-Learn
  • Tuned and optimized hyperparameter selections using grid search and k-fold cross validation
  • Compared the performance of the model with that of Pandas Keras ANN
Aug 2019 - Dec 2019

Face Recognition App

HTML, CSS, JavaScript, React, Node.js, Express, PostgreSQL
  • Designed a web app that identifies faces in images using React, NodeJS, PostgreSQL and the Clarifai API
  • Designed a login system and enhanced security by hashing passwords and using knex to avoid SQL injections
Apr 2018 - July 2018

University Library Management System

SQL, Oracle APEX, MS Visio
  • Designed and implemented a library book lending software using Oracle SQL Database
  • Developed a client-server application to add, borrow, remove books and calculate late fees using Oracle APEX
  • Designed a library system ER Diagram with 8 tables following normalization rules using Microsoft Visio
  • Optimized SQL select speed by over 90% by creating composite index on complex lookup queries
Aug 2018 - Dec 2018

Analysis of Customer Churn in Telecommunication Sector

Classification | SAS
  • Implemented Logistic Regression Model to predict the likelihood of customer left the company in the previous month, and enhanced the performance with an 85% accuracy through stepwise model selection approaches
  • Analyzed the behavior of telecom customers, and provided marketing suggestions for current and potential customers to maximize the profit of the company
Oct 2020 - Nov 2020

Covid-19 Data and Technology Contest

Tableau
  • Created a Covid-19 business plan with report on analysis of datasets, approach and technology chosen
  • Presented mock-ups in the form of visualizations and recommended technology solution to the company
Aug 2020

Web Development Skills

Programming Languages & Tools
HTML
HTML5
CSS
CSS
Javascript
JavaScript
React
React
Python
Python
Java
Java
Git
GIT
Node
Node.JS
Postgresql
PostgreSQL
npm
NPM
bootstrap
Bootstrap
Awards & Qualifications
  • Google Analytics Certified
  • 2016 Microsoft Office Specialist
  • Tableau Desktop Specialist
  • Covid-19 Data and Technology Contest Finalist

Projects Demo

Contact Me


Email: dtliaomr@gmail.com