Course
Introduction to Data Science (DAT540)
The course will provide a knowledge and experience in data engineering tasks and will accustom students with data science project lifecycle.
Dette er emnebeskrivelsen for studieåret 2019-2020. Merk at det kan komme endringer.
Semesters
Fakta
Emnekode
DAT540
Vekting (stp)
10
Semester undervisningsstart
Autumn
Undervisningsspråk
English
Antall semestre
1
Vurderingssemester
Autumn
Content
Learning outcome
Knowledge:
- Execute tools to load, parse, clean, transform, merge, reshape, and store data.
- Compare regular Python, NumPy, and Pandas data structures and choose one for the given problem. Use the IPython shell and Jupyter notebook for exploratory computing.
- Execute simple machine learning or data mining algorithms.
Skills:
- Organize data analysis following CRiSP-DM and Data Science Process
- Build engaging visualizations of data analysis using matplotlib
- Optimize data analysis applying available structure and methods
- Evalute, communicate and defend results of data analysis
General qualifications:
- Solve real-world data analysis problems following a well-structured process
Forkunnskapskrav
Exam
Form of assessment | Weight | Duration | Marks | Aid | Exam system | Withdrawal deadline | Exam date |
---|---|---|---|---|---|---|---|
Project work and oral presentation | 1/1 | Letter grades | — | — | — |
Project is completed in groups. Project work is to be performed at the times and in the groups that are assigned and published. Absence due to illness or for other reasons must be communicated as soon as possible to the lecturer.
Both project and oral examination must be done before final grade is given. Each group member can receive a different grade based on their performance during the oral examination.
If a student fails the projectwork , he/she have to take this part again next time the subject is lectured.
Vilkår for å gå opp til eksamen/vurdering
Fagperson(er)
Head of Department:
Tom RyenCourse coordinator:
Antorweep ChakravortyMethod of work
Åpent for
Emneevaluering
Litteratur
(Required) Python for Data Analysis 2nd Edition, McKinney, O'Reilly ISBN: 978-1491957660
(Optional) Python Data Science Handbook, VanderPlas, O'Reilly ISBN: 978-1491912058
(Optional) Building Machine Learning Systems with Python 2nd Edition, Coelho and Richert, PACKT ISBN: 978-1784392772