4 April 2018

Feature Structure Goals in Spark

Classification & Regression
End Goal:
  • Column of type Double to represent Label
  • Column of type Vector (Sparse or Dense)
End Goal:
  • Column of Users
  • Column of Items
  • Column of Ratings
Unsupervised Learning
End Goal:
  • Column of Type Vector (Sparse or Dense)
Graph Analytics
End Goal:
  • DataFrame of Vertices
  • DataFrame of Edges