Repository logo
 

An Empirical Exploration of Python Machine Learning API Usage

dc.access.optionOpen Access
dc.contributor.advisorHills, Mark
dc.contributor.authorVilkomir, Aleksei
dc.contributor.departmentComputer Science
dc.date.accessioned2020-12-18T15:45:57Z
dc.date.available2020-12-18T15:45:57Z
dc.date.created2020-12
dc.date.issued2020-11-16
dc.date.submittedDecember 2020
dc.date.updated2020-12-18T14:31:23Z
dc.degree.departmentComputer Science
dc.degree.disciplineMS-Software Engineering
dc.degree.grantorEast Carolina University
dc.degree.levelMasters
dc.degree.nameM.S.
dc.description.abstractMachine learning is becoming an increasingly important part of many domains, both inside and outside of computer science. With this has come an increase in developers learning to write machine learning applications in languages like Python, using application programming interfaces (APIs) such as pandas and scikit-learn. However, given the complexity of these APIs, they can be challenging to learn, especially for new programmers. To create better tools for assisting developers with machine learning APIs, we need to understand how these APIs are currently used. In this thesis, we present a study of machine learning API usage in Python code in a corpus of machine learning projects hosted on Kaggle, a machine learning education and competition community site. We analyzed the most frequently used machine learning related libraries and the sub-modules of those libraries. Next, we studied the usage of different calls used by the developers to solve machine learning tasks. We also found information about which libraries are used in combination and discovered a number of cases where the libraries were imported but never used. We end by discussing potential next steps for further research and developments based on our work results.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10342/8796
dc.language.isoen
dc.publisherEast Carolina University
dc.subjectMachine Learning API
dc.subjectMachine Learning exploratory
dc.subject.lcshMachine learning
dc.subject.lcshApplication program interfaces (Computer software)
dc.subject.lcshPython (Computer program language)
dc.titleAn Empirical Exploration of Python Machine Learning API Usage
dc.typeMaster's Thesis
dc.type.materialtext

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
VILKOMIR-MASTERSTHESIS-2020.pdf
Size:
285.89 KB
Format:
Adobe Portable Document Format