My name is Szymon Jakubiak

I'm an engineer, which means I really like solving technical problems. I also enjoy modern AI and data-driven solutions that make the whole process even more adorable.
Please, take a minute and look around. I'm convinced you may find something interesting among my projects.

Heating nodes - Demand estimation

This project is focused on data processing and building models for estimation of power demand and flow in customers' heating nodes, depending on time of the year.

Problems:

  • filtering out errors of measuring instruments
  • rapid demand change detecion
  • season classification
  • demand modeling

Technologies:

  • pandas
  • NumPy
  • scikit-learn
  • SQL
  • Git

Designed for internal use with IT infrastructure of Veolia Energia Warszawa S.A.

Airliners - Image classifier

Dockerized web application that allows for classification of airliner model based on uploaded image.

Problems:

  • collecting photos of airliners
  • filtering out irrelevant images
  • training image classifier
  • web application deployment

Technologies:

  • fastai
  • pytorch
  • docker
  • dash
  • plotly

Turbofan engines market - Patents

The main goal of this project is to discover and describe changing trends in the turbofan engines market, based on patent applications.

Problems:

  • collect data from European Patent Office
  • interactive visualization

Technologies:

  • Open Patent Services API
  • R
  • shiny
  • ggplot2

Airplane crashes - Natural language processing

This project contains analysis of reports from airplane crashes. The goal is to find common types of these accidents and recognize the most fatal ones.

Problems:

  • word tokenization
  • tokens preprocessing
  • topic modeling

Technologies:

  • pandas
  • spaCy
  • gensim

r/WorldNews - Analysis of posts

Project aimed for analysis of news, with COVID-19 background. Based on posts from subreddit r/wordnews and Our World in Data virus dataset.

Problems:

  • scraping reddit posts
  • data processing and quality assessment
  • visualisation

Technologies:

  • Pushshift web API
  • pandas
  • Matplotlib
  • Jupyter Notebook

Mixing up - GUI experiment

Application with basic GUI for calculating mixture's components in correct proportions.

Problems:

  • correct division between the view and the model
  • dynamic interface rebuilding
  • validating user input
  • tracking source code changes

Technologies:

  • PyQt5
  • Git

From scratch - neural network in C++

Keras-like neural network library build from scratch in C++. Shipped with MNIST handwritten digits dataset, formatted for my application.

Problems:

  • building full set of features from data loading to stochastic gradient descent
  • enabling future extension of loss/activation functions
  • effective use of STL containers

Technologies:

  • C++
  • make