Research interests

  • I am interested in developing application-driven machine learning innovations, particularly for geospatial, geophysical, and Antarctic science.
  • Geospatial modelling
  • Physics-informed ML
  • Climate, Earth Science, and Geophysics applications
  • Antarctica and the Cryosphere
  • Earth observation (EO)
  • EO foundation models
  • Active learning with Bayesian Optimisation (BO)
  • Causal inference

About

Headshot of Kim Bente

My name is Kim Bente and I am a final year Machine Learning PhD student at the School of Computer Science, University of Sydney ,supervised by Prof Fabio Ramos (NVIDIA, University of Sydney), and A/Prof Roman Marchant (Human Technology Institute, UTS). My research interests are centred around using probabilistic machine learning and statistical computing methods to address Climate Science problems. I am particularly interested in the quantification of uncertainty to inform high-stakes decision in the climate domain. I am currently working on the application Bayesian Optimisation to Antarctic research problems, including sensor network design, data fusion and ice core drilling site determination. I am a member of the DARE (Data Analytics for Resources and Environments) ARC Training Centre, an Industrial Transformation training centre, led by the University of Sydney. I have gained extensive experience on interdisciplinary research projects, most notably collaborating with Nutrition and Dietetics research, and also working on educational data, as well as criminology data.

I have completed the Master of Data Science [with high distinction] from the University of Sydney in 2020 and I hold a Bachelor of Science from the Technical University Munich, in Management & Technology, specialising in Chemical Engineering and Finance.

Please contact me via kim.bente@sydney.edu.au

Check out my GitHub repositories for preprocessing pipelines of Antarctic datasets (e.g. Bedmap3 points, Bedmap maps, BedMachine, RACMO 2.4, GRACE-FO, MODIS MOA, NASA's MEaSUREs datasets), Physics-Informed ML models built in PyTorch, Uncertainty Quatification using GPyTorch and BOTorch for example, and much more.

ML4Cryo news

ML reading group

I organise our weekly Machine Learning & Robotics reading group at the University of Sydney, led by Prof Fabio Ramos. If you're interested in joining you can sign up to the USYD ML reading group mailing list.

Publications

Conferences

Seminars & talks

Reviewing

  • NeurIPS 2025 Tackling Climate Change with Machine Learning workshop
  • Conference on Uncertainty in Artificial Intelligence (UAI) 2025
  • ICLR 2025 Tackling Climate Change with Machine Learning workshop
  • Conference on Uncertainty in Artificial Intelligence (UAI) 2024
  • ICLR 2024 Tackling Climate Change with Machine Learning workshop

Education

  • Machine Learning PhD student (current), The University of Sydney, Australia
    'Uncertainty Quantification and Physical Priors for Geospatial Models of Antarctica', supervised by Professor Fabio Ramos and A/Prof Roman Marchant
    member of the DARE (Data Analytics for Resources and Environments) ARC Training Centre
  • Master of Data Science, The University of Sydney, Australia
    completed with High Distinction in 2020
    Final research project (Capstone) 'New Perspectives on the Validation of a Mobile App for Dietary Assessment using Data Mining', supervised by Professor Judy Kay
    Faculty of Engineering Summer Research Program Scholarship 2019/2020 for the project 'Data mining of dietary patterns', supervised by Prof Judy Kay, A/Prof Irena Koprinska and A/Prof Kalina Yacef
  • Graduate Certificate in Data Science, The University of Sydney, Australia
    completed with High Distinction in 2019
  • Bachelor of Science in Management & Technology, Technical University of Munich, Germany
    Specialisation in Chemical Engineering
    Bachelor's Thesis about 'Knowledge Management in a Dynamic Organisation'

Teaching

  • Tutor for Computer Science Research Methods [INFO5993], School of Computer Science
    Semester 2 2024, postgraduate research education
  • Tutor for Computer Science Research Methods [INFO5993], School of Computer Science
    Semester 1 2023, postgraduate research education
  • Tutor for Human-in-the-Loop Data Analytics [DATA3406], School of Computer Science
    Semester 2 2022, undergraduate level tutoring
  • Tutor for Data Analysis in the Social Sciences [DATA5207], United States Study Centre
    Intensive February 2021, postgraduate level tutoring
  • Tutor for Algorithms [COMP9007], School of Computer Science
    Semester 2 2019, postgraduate level tutoring
  • Course analytics for Human-in-the-Loop Data Analytics [DATA3406], School of Computer Science
    Educational data analytics, student performance feedback with SRES, development of a computer-based marking program for Semester 2 2020

Professional experience

  • Customer Analytics and Customer Relationship Management at FOURSOURCE, a tech start-up in apparel sourcing, Berlin, Germany
  • Innovation Consulting and Ideation, Junior Innovation consultant, HYVE, Munich, Germany
  • Management Consulting, media sector, working student, goetzpartners advisory, Munich, Germany
  • Research & Development (R&D) and International Marketing, internships, consumer goods sector, Schwarzkopf (Henkel), Hamburg, Germany
  • Product development and product management, internship, consumer goods sector, Kneipp, Würzburg, Germany

Outside of research...

  • Running, Cycling, Tennis, Hiking, Swimming, Skiing
  • Designing and sewing timeless garments and other textile creations
  • Cooking healthy food, travelling, exploring art

Daily Data Dose

The Daily Data Dose [DDD] is a compilation of my favourite data-related resources, both informative & entertaining. My shortlist includes podcasts, websites, talks and other useful things that I came across. Most of these I have recommended to a plethora of data-enthusiasts, so it was time to put a list on the web. I thank everyone who has introduced me to any of these gems. Links are included below.

  • PolarGlobe 4D climate data visualisation


    Amazing nowcasts as well as 12 day forecasts of global wind and ocean currents for the whole globe.

  • The Vanishing Gradients podcast


    This is my new favourite Data Science podcasts!

  • The Dataframed podcast


    This is my previous favourite Data Science podcasts! I have listened to every of the 60 episodes at least once, soaking up insights from industry to academia. I can't recommend this enough, and if you know me I will have likely already recommended it to you.

  • My compilation of Antarctic data sets


    A repository on my GitHub with an overview of icy data sets, frosty models and other high-latitude data collections. I focus on the Antarctic ice sheet.

  • 'AI for Earth' projects by Microsoft


    I learnt about some of the projects from 'AI for Earth' at a very inspiring talk by Jennifer Marsman at the Microsoft reactor in Sydney. I was very fascinated by the creative use of AI, especially by the World Mosquito Program and the Whalebook. I recommend checking out the projects.

  • Process Mining Coursera course by Professor Wil van der Aalst


    I took this course on Process Mining which is not only interesting from an algorithmic point of view but very relevant for any temporal or procedural data set. Great course!

  • "Environment Variables: how software impacts climate change" - Merrin Macleod (PyCon AU 2019)


    This is an outstanding talk which was recommended to me by a colleague (Thanks Ben!). If you care about the environment and you work in tech, you will surely learn something new here.

  • The Pudding


    The Pudding is website which publishes visual essays about societal and cultural topics. They are pushing the boundaries of data journalism and therefore always worth a digital visit.

  • Gapminder


    Gapminder is a great organisation that promotes a more fact-based worldview. They also have an extensive collection of open access data sets. Furthermore, I highly recommend reading the book 'Factfulness' by Hans Rosling.

  • Information is Beautiful


    A collection of informative and visually appealing data visualisations.

  • Google dataset search tool


    A search engine for data sets. A very useful tool to find openly available data sets.

  • Data Science by John D. Kelleher and Brendan Tierney


    I recommend this book to family and friends who want to understand what I do, but I also like going back to this pocket book by MIT Press to reset my own perspective for the big picture.

  • Papers with code


    Professor Caren Han's NLP course [COMP5046] introduced me to this site, which is a great resource to stay on top of the latest ML research in different fields.

  • Datasaurus, the new Anscombe's quartet


    An updated and more creative version of the Anscombe's quartet, including animated visualisations. This demonstrates once again that summary statistics are not enough to describe data.

  • Spurious correlations blog by Tyler Vigen


    A good reminder that correlation ≠ causation.

  • The True size of ...


    This is a fun web tool to understand the distortion of the size of countries, as they are displayed on the world map.

Contact

built by Kim Bente