Skip to content

A set of simple studies and solutions

Predict whether a mammogram mass is benign or malignant

  • Data: The dataset can be found from University of Irvine: Mammographic Mass.
  • Goal: Build a Multi-Layer Perceptron and train it to classify masses as benign or malignant based on its features.
  • Challenges: The data needs to be cleaned; many rows contain missing data, and there may be erroneous data identifiable as outliers as well.
  • Approach:

    • Review data quality, and missing data. Drop if not a lot of records are wrong
    • Transform the data to be usable by sklearn using numpy

See personal notebook in mammogram_mass folder

Computer vision with PyTorch: classify sushi, pizza and steak

  • Data: The food 101 dataset from PyTorch vision
  • Goal: Develop a NN to classify images
  • Challenges: The number of layers
  • Approach: Develop a basic NN and then compare it with existing CNN
Demonstration with pytorch scripts
  • Create a virtual env and install requirements under the pytorch folder; pip install -r requirements.txt
  • Under computer vision, load the data sets locally: python prepare_image_dataset.py --classes sushi,steak,pizza
  • Do a simple classification using a Tiny VGG: python classify_food.py
  • Use transfer learning (see explanations here): python transfer_learning.py

Other use case

  • Data:
  • Goal:
  • Challenges:
  • Approach: