ZHANG Chuyue | Data Analyst Portfolio

📁 Projects Portfolio

📝 LSTM-Based Sentiment Analysis on Twitter User Comments (Apr 2024 – Jul 2024)

Problem: Accurately analyze emotional polarity in Twitter user comments to understand public opinion, especially handling simple vocabulary, emojis, and complex symbols.

Data: Twitter user comments dataset containing varied text (simple words, emojis, and complex symbols).

Approach: Built a neural network with embedding layer, Dropout, bidirectional LSTM layer, and fully connected layer; trained using Adam optimizer; performed literature review, model reproduction, and performance improvements.

Outcome/Impact: Achieved 98% accuracy on simple-vocabulary comments, 91% on emoji/complex-symbol comments, and 88% overall test-set accuracy. The model showed strong adaptability across comment types and provided empirical support for LSTM-based sentiment analysis.

Your contribution: Team member – innovated and optimized the model after reproducing baseline, organized literature review, visualized results and training charts, created presentation PPT, proofread/improved project report, wrote main paper content, and refined format/structure for accuracy and logical rigor.

View Acceptance Letter (EEIC 2024)

🗄️ Reverse Engineering and Optimisation Design of Meituan Waimai Business Database (Sep 2025 – Dec 2025)

Problem: The complex multi-sided food delivery platform (Meituan Waimai) required reverse engineering of its database to support user membership, payment processing, and review systems while ensuring data integrity and query efficiency.

Data: Conceptual data derived from core app functionalities (user profiles, addresses, payments, transactions, reviews, restaurants, riders).

Approach: Identified entities, attributes, and primary keys; designed ER diagrams for three modules (membership, payment, review); created relational schema and normalized tables to 3NF; designed denormalized queries for user behavior analysis.

Outcome/Impact: Delivered a coherent, normalized relational database schema that supports high-frequency transactions, real-time status updates, and efficient analytics dashboards; enabled targeted marketing through user segmentation and membership value analysis.

Your contribution: Team member – contributed to entity/relationship modeling, ER diagram creation, relational schema design, normalization process, and user behavior data analysis design.

🕹️ Dynamic Maze Explorer: Maze Navigation Game Based on Deep Q-Learning (Personal Project)

Problem: Create an interactive maze navigation game where an agent must learn optimal paths in a stochastic environment with moving traps, collectible coins, and random maze layouts.

Data: Dynamically generated 10×10 mazes (perimeter + random internal walls) with real-time state representation (agent, goal, traps, coins, walls).

Approach: Implemented Deep Q-Network (DQN) using PyTorch (3-layer NN, replay buffer, target network, ε-greedy); custom dense reward function with distance shaping; Pygame for rendering, animations, UI, and manual/AI modes.

Outcome/Impact: Agent converged to positive rewards (+303.4 max in 4 steps), win rate improved from <5% to ~25%; created an educational tool demonstrating reinforcement learning in dynamic, uncertain environments with real-time visualization.

Your contribution: Individual work (environment design, DQN implementation, reward engineering, UI, training pipeline, and documentation).

▶️ View YouTube Demo 🔗 View GitHub

🖼️ Deep Learning for Image Classification on CIFAR-100 (Jan 2025 – May 2025)

Problem: Build and optimize deep learning models for accurate classification on the challenging CIFAR-100 dataset (100 fine-grained classes, small 32×32 images, limited samples per class).

Data: CIFAR-100 dataset (60,000 color images: 50,000 train, 10,000 test; split 80/20 train/validation + official test).

Approach: Preprocessing (normalization + augmentation: random crop, horizontal flip); baselines (linear model, MLP with ablation on hidden size/activation/dropout/weight decay); advanced residual-style CNN with BatchNorm, dropout, and residual connections; controlled experiments on loss, LR, and batch size.

Outcome/Impact: Best MLP configuration achieved 36.20% test accuracy; comprehensive ablation studies and learning curves demonstrated hyperparameter impact; provided clear insights into transitioning from simple to advanced architectures for real-world computer vision tasks.

Your contribution: Implemented training pipelines, conducted all ablation experiments, generated learning curves, accuracy tables, visualizations, and performance analysis (group project).

🔗 View GitHub

Seeking Position: Data Analyst / Business Intelligence Intern

📌 About Me

⚙️ Skills

Technical Skills

Business Skills

Tools

📁 Projects Portfolio

📝 LSTM-Based Sentiment Analysis on Twitter User Comments (Apr 2024 – Jul 2024)

🗄️ Reverse Engineering and Optimisation Design of Meituan Waimai Business Database (Sep 2025 – Dec 2025)

🕹️ Dynamic Maze Explorer: Maze Navigation Game Based on Deep Q-Learning (Personal Project)

🖼️ Deep Learning for Image Classification on CIFAR-100 (Jan 2025 – May 2025)

📄 My Resume

📫 Contact Me