Curriculum Vitae - Xiao Song

Chinese version
PDF

CONTACT

SOFTWARE & PROGRAMMING SKILLS

  • Python, R, SQL (Including MySQL, SQL Sever, SQLite)
  • SPSS, Stata, Git, MS Office, \(\rm{\LaTeX}\)

EDUCATION

TRAINING EXPERIENCE

WORKS & PROJECTS

  • Spam Message Dectection R Shiny App 
    2019-12~2020-03
    This program uses 5567 pieces of English short message data as a training set and trains algorithms such as Logistic Regression, Naive Bayes, Decision Tree, Random Forest, and Support Vector Machine. The trained model is written as a Shiny App based on the R language. The user can enter a text message and select a classifier to get the classification result of the text. Considering the user’s language habits, a Chinese-English bilingual interface switching function is specially set up.
    urlgithub

ACADEMIC RESEARCH

  • Machine Learning in Social Sciences: Based on China Education Panel Survey 2020
    Bachelor Degree Thesis (PDF)

  • Welfare Effect and Social Inequality of Land Transfer: Empirical Analysis Based on CFPS 2018-2019
    National Innovation Training Program for College Students, Independent author
    The data of China Family Panel Survey (CFPS) were used for data cleaning and econometric analysis through Stata and R. Using Unconditional Quantile Regression and Fixed Effect Model estimate the welfare effect of land transfer behavior and its impact on social inequality. Using R’s ggplot2 software package to visualize geographic information. Output chart and finally form about 12,000 words of research papers. (PDF)

HONORS AND AWARDS

  • 2020 Kaggle M5 Forecasting - Accuracy
    Estimate the unit sales of Walmart retail goods
    103rd/5558 Top2% Silver Medal

  • 2020 Kaggle M5 Forecasting - Uncertainty
    Estimate the uncertainty distribution of Walmart unit sales
    18th/909 Top2% Silver Medal

  • 2019 Third Class Academic Honors (East China Normal University)

  • 2019 Daxia Cup Student Academic Works Competition in ECNU  Third Award

  • 2019 The 2nd National University Data Driven Innovation Research Competition Excellence award

  • 2018 12th Social Science Forum for Undergraduates Highest Award

  • 2018 Daxia Cup Student Academic Works Competition in ECNU  Second Award

  • 2018 Second Class Academic Honors (East China Normal University)

  • 2017 Second Class Academic Honors (East China Normal University)

  • 2017 The 12th Wisdom Cup Philosophical Essay Competition in ECNU First Award

CONFERENCE PRESENTATIONS

WORK & INTERNSHIP EXPERIENCE

  • Zhongnan University of Economics and Law Data Consultant 2020-02~2020-04 Remote internship. Use Xgboost, RandomForest, LightGBM and other algorithms to classify (multiclass) legal text data. The word frequency method is used to construct the feature matrix, and the cross-validation training model (sklearn) is used to obtain the cross-validation accuracy of 0.75. I write a program to make predictions on new data, so that the prediction results can be applied to any new data set.

  • iResearch Data Analyst 2019-07~2019-09

    • Using R and SPSS to analysis profile of cars’ users. Through PCA and Cluster analysis, I catogorized survey data and found cars users’ attitude difference.
    • Using MySQL database to help analyze users’ data.
    • Using Hive SQL to help access Hadoop database.

 

STANDARDIZED TEST

Verbal Quantity Writing
154 167 3.5
Reading Listening Speaking Writing
29 27 21 26

OTHER EXPERIENCE

  • 2018-2019 East China Normal University
    Regression Analysis and Stata Application (Shisong Qing)
    Teaching Assistant