Shih Peng Wen

prof_pic.jpg

I am a Python Developer with expertise in Data Science, Natural Language Processing.

In my previous role, I worked at at the Communication Data and Network Analytics Lab (CDNA), situated within the Research Center for Humanities and Social Sciences (RCHSS) at Academia Sinica.

While my academic background lies in Sociology, I have developed a comprehensive skill set in data science through self-study. With 2 years of practical experience, including data collection, data cleaning. My expertise extends to Natural Language Processing(NLP).

I completed my Master’s degree in Sociology at Tunghai University. In my thesis, I focused on utilizing the qualitative approach called Sequential Analysis from Objective Hermeneutics to understand how individuals in Taiwan perceive entering higher education. My passion for sociology stems from my interest in exploring how people interact with each other and how these interactions shape society’s social structure.


personal projects

  1. NDLTD TW Papers Graph
    • This project uses Vue.js for the frontend and FastAPI for the backend. It utilizes Opensearch as vector database.
    • This project employs GitHub Actions for automating the CI/CD process and hosted on AWS.
    • Uses AWS Lambda as web scraper to gather data, using Sentence-Transformer to understand the meaning of words, and uses KNN to find articles that are similar to each other.
    • Inpired by: Keyword Analysis (GroundAI)
    • https://ndltd-tw-papers-graph.wspooong.com

skilled-based project

  1. Facebook Group Scraper