Numerical Computing using Julia
This workshop will illustrate using the Julia language in data-driven modeling and computations. More detailed description will be posted here soon.
This workshop will illustrate using the Julia language in data-driven modeling and computations. More detailed description will be posted here soon.
As machine learning gains more and more popularity in science and technology in recent years, scikit-learn becomes one of the must-have libraries in the general machine learning toolbox. In this lecture we will present an in-depth introduction to scikit-learn python library. Prerequisite knowledge for the workshop includes Python programming and basic machine learning concepts.
This workshop will discuss the basics of MPI (message passing interface), and several distributed-memory parallel numerical computing examples using MPI. The Julia language will be used in the examples. It would be straightforward to translate the Julia examples into other languages, such as C and Fortran, if desired.
Please REGISTER in advance here. Workshop will be conducted in Zoom in PST time. Invites are sent out the day before the workshop. 3D models can be formed in a number of ways. Two common surveying methods that result in 3D models are photogrammetry and light detection and ranging or lidar for short. A...
Please REGISTER in advance here. Workshop will be conducted in Zoom in PST time. Invites are sent out the day before the workshop. The purpose of this workshop is to discuss commonly used techniques for handling missing data and common issues that could arise when these techniques are used. In particular, we will focus...
Python is a very popular language for computational and data science, and it has many powerful capabilities for visualizing data. Data visualization plays an essential role in representing data so that its underlying patterns might be conveyed and understood. The first part of this series will showcase various ways that Python can be used for...
Registration link: https://ucla.zoom.us/meeting/register/tJwkdu6hqj4sG9ZioB0tM8VuXxA44AWGc6W- Visual interpretations are meaningful ways to determine spatial trends in our data. However, underlying factors—such as inconsistent geographies, scale, data gaps, overlapping data—have the potential to produce incorrect assumptions, as valuable information may be conveniently hidden from the visual output. One way to address this issue is to amend your visual output...
Python is a very popular language for computational and data science, and it has many powerful capabilities for visualizing data. Data visualization plays an essential role in representing data so that its underlying patterns might be conveyed and understood. The second part of this series will explore various ways that Python can be used to...
Singularity is a free, cross-platform and open-source software that can run Operating System Virtualization also known as Containerization. This type of virtualization allows you to run an Operating System within a host Operating System. Hoffman2 has the ability to run containerized software with singularity. This presentation will demonstrate using Singularity on Hoffman2. This presentation will...
Zoom registration required: https://ucla.zoom.us/meeting/register/tJAucO-orzkvHtGtJFXfN76vvx75TY-1QxIG This seminar teaches the “grammar” of graphics that underlies the ggplot2 package, allowing the user to build eye-catching, publication-quality graphics layer-by-layer. We cover the basic elements of the grammar of graphics, including aesthetics, geoms, scales, and themes, and we will show you how easy ggplot2 makes it to integrate these elements...
The Julia language has a rich and fast-growing ecosystem in the form of open-source packages. We will cover the fundamentals of Julia's package management system, using either via the REPL (interactive command prompt) or the Pkg package, and show how to manage multiple sets of packages in the user environment. We will also discuss how to organize your Julia code into a package and share with others. Registration required.
This workshop series will present an extensive discussion on how to improve the performance of Python in data science by looking under the hood of its language/libraries and using the technologies to make Python a practical solution for the high-performance big data analytics. In the first session, we will focus on how to boost the speed of python code in an interperter level by explaining the concepts (e.g. GIL, GIT) and introducing the packages of pypy, numba, pythran, cython etc. Although no specific prerequisite is required to attend the talk, having programming experience in Python will be helpful to fully understand the lecture content.
** THIS WORKSHOP IS BEING RESCHEDULED TO MAY 10, 2021 ** The Hoffman2 Cluster is a powerful computational resource for the UCLA research community. This workshop is part I of a three-seminar series designed to introduce users to the Hoffman2 Cluster environment and to clarify the process of porting applications or using applications already available...
Zoom registration required: https://ucla.zoom.us/meeting/register/tJAocumvrTkoHdVjqIGxWTdErUZLo_Z5hccE Regression analysis is one of the most powerful statistical techniques that is used to explain variability in a response (dependent) variable as a function of one or more predictor (explanatory or independent) variables. The aim of this seminar is to help participants increase their skills in using regression analysis with R....
This workshop series will present an extensive discussion on how to improve the performance of Python in data science by looking under the hood of its language/libraries and using the technologies to make Python a practical solution for the high-performance big data analytics. In the second session, we will focus on how to load/process the super big dataset in Python using a single machine and comparing the dataframe implementations from Pandas, Modin, Pandarallel, Dask and Vaex etc. Although no specific prerequisite is required to attend the talk, having programming experience in Python's numpy and Pandas packages will be helpful to fully understand the lecture content.
** THIS WORKSHOP IS BEING RESCHEDULED TO MAY 17, 2021 ** The Hoffman2 Cluster is a powerful computational resource for the UCLA research community. This workshop is part II of a three-seminar series designed to introduce users to the Hoffman2 Cluster environment and to clarify the process of porting applications or using applications already available...
Zoom registration required: https://ucla.zoom.us/meeting/register/tJwkcuiqrjovGdFdq0u6EebmpiiX_KW00ohH This workshop will cover basic concepts of confirmatory factor analysis by introducing the CFA model and looking at examples of a one-factor, two-factor and second-order CFA. Concepts such as model identification, standardized solutions, and model fit statistics such as the chi-square statistic, CFI, TLI and RMSEA will be covered. The focus...
The Julia language has a unified interface to its plotting functionalities, supported by multiple backends such as plotly and pyplot (matplotlib). We will cover the fundamentals of Julia's plotting capability, and show the process of creating data visualization from select examples, including interactive and animated plots. Registration required.
A virtual machine is an isolated operating system environment running on top of the host computer’s original operating system. This hypervisor-based technology enables us to run one or multiple operating systems on the same host computer (e.g. a Linux virtual machine on top of a Windows host computer). However, a virtual machine can be resource-heavy...
** THIS WORKSHOP IS BEING RESCHEDULED TO MAY 24, 2021 ** The Hoffman2 Cluster is a powerful computational resource for the UCLA research community. This workshop is part III of a three-seminar series designed to introduce users to the Hoffman2 Cluster environment and to clarify the process of porting applications or using applications already available...
Zoom registration required: https://ucla.zoom.us/meeting/register/tJUrcO2qpjopHdZ1iMHcg9ksIr0czZ3ixKiL This workshop will discuss the following topics: • how complex survey data are different from other types of data • how to do to basic descriptive statistics with continuous and categorical variables • how to make descriptive graphs with complex survey data • how to run a variety of regression models,...
As machine learning gains more and more popularity in science and technology in recent years, scikit-learn becomes one of the must-have libraries in the general machine learning toolbox. In this lecture we will present an introduction about the basics of scikit-learn python library. Prerequisite knowledge for the workshop includes Python programming and basic machine learning concepts.
As machine learning gains more and more popularity in science and technology in recent years, scikit-learn becomes one of the must-have libraries in the general machine learning toolbox. In this lecture we will discuss some advanced topic on using scikit-learn python library to make high performance machine learning, specifically the speedup modeling using multicore and accelerators and out-of-core learning. The session assumes the knowledge of topics covered in the first session. Working experience on Python programming, basic machine learning concepts and Scikit-learn will be helpful.
Augmented reality (AR) is the layering of computer-generated perceptual information onto real environments. The experience requires a level of interactivity and sensory engagement. AR is being adopted across the fields of architecture and engineering, as well as adopted by artists to make creative connections between our shared reality and that of innumerable virtual worlds. This...
Research projects are often confronted with the need to publish results to communicate their findings to the general audience. Publishing research on an interactive web-based map is an effective and visually compelling way to do so. In this workshop, participants will learn the basics of HTML, CSS and Javascript. We will use the Leaflet library...
This seminar is a continuation of the first seminar on Confirmatory Factor Analysis (CFA) in R with lavaan. Topics include multiple group CFA measurement invariance latent growth modeling Some time will be given at the end for interactive coding exercises. Instructor: Johnny Lin, Ph.D., OARC/IDRE Statistical Consulting Time: Aug 16, 2021 01:00 PM Pacific Time...
Workshop description and Zoom registration link coming soon!
This workshop is designed to give an overview on generalized linear models. The workshop introduces the basic theory of generalized linear models and their implementation in R. We will talk about a broad range of regression models such as Logistic regression, Poisson regression, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial and how to run...
Video Link: https://youtu.be/RVdWM1SbB4s The IDRE Early Career Researchers group is excited to restart its monthly meetings. This first meeting will introduce five IDRE scholars selected from a large pool of applicants, followed by a panel discussion on interdisciplinary research and collaboration. The following eminent UCLA researchers will be the panelists: Karen McKinnon, Institute of the Environment...
Speaker: Ricky Savjani, Ph.D. IDRE Scholar, Department of Radiation Oncology, University of California Los Angeles Abstract: Several technological advances in radiotherapy have enabled the use of focused radiation to treat solid tumors within the thoracic cavity. Stereotactic Body Radiation Therapy (SBRT) offers a way to treat patients with high doses of...
IDRE ECR Group is excited to announce Machine learning for oceanic & atmospheric sciences workshop with the following details: Title: Machine Learning for Oceanic & Atmospheric Sciences Date and Time: Friday, January 7, 2022 @10 AM (PST) Registration: https://ucla.zoom.us/meeting/register/tJclde2opjsiGdU0sq31tMP5YUXd8FbvCbCm Abstract: Machine learning (ML) denotes a host of computational methods for inferring meaningful patterns in data....
Git is a software tool that helps users manage changes to their software over time. Git will allow you to maintain a complete change history of every file, create branches for concurrent streams of changes, trace changes with annotations, and collaborate and share work with others. This interactive introduction will demonstrate how to use Git...
MPI (Message Passing Interface) is a standardized interface for portable distributed-memory scientific parallel computing. The portability ensures that a properly-written, standard-conforming MPI program can work the same way on different platforms ranging from laptop computers to massively parallel supercomputers. MPI has been widely used in advanced simulations, data analysis and visualization in the last two...
Description coming soon. Register in advance: Please REGISTER using the Zoom Meeting Link before joining! If you have any questions regarding this event, please contact Ben Winjum.
In this workshop we will focus on the methods to execute Matlab interactively and run its programs via job scheduler on Hoffman2 cluster. (We will not describe how to write Matlab scripts nor how to use its internal/external functions.) Several methods to launch Hoffman2 Matlab GUI from different operating systems will be discussed, including the...
Speaker: Casey Youngflesh IDRE Scholar, Department of Ecology and Evolutionary Biology, University of California Los Angeles Time: 11:30 AM – 12:30 PM (PST) Date: Jan 28, 2022 Location: Zoom (RSVP here for the link) Abstract: Rapid abiotic environmental change is driving a multitude of shifts in natural systems across the Earth. One of...
MPI (Message Passing Interface) is a standardized interface for portable distributed-memory scientific parallel computing. The portability ensures that a properly-written, standard-conforming MPI program can work the same way on different platforms ranging from laptop computers to massively parallel supercomputers. MPI has been widely used in advanced simulations, data analysis and visualization in the last two...
This workshop will introduce how to use Python/matplotlib and other packages to generate data visualization and animation for publications or presentations. We will discuss plotting functions and techniques available in both Matlab and Python to handle and customize your graphics in practice. The materials include simple 2D/3D plots to present scalers and vectors, filled plots...
Landsat imagery of the 2020 Bobcat wildfire generated using Python Register here Satellites are circling our planet, allowing us to "sense" things about the Earth. It is the art and science of making measurements using sensors. Remote sensing has thus become a valuable tool in research and applications in a wide range of disciplines, such...
Registration link: https://ucla.zoom.us/meeting/register/tJYqc-6rqz0qGdCvJuus9bUKV2xVtQHZQnHu The Hoffman2 Cluster is a powerful computational resource for the UCLA research community. This workshop is part of a three-seminar series designed to introduce users to the Hoffman2 Cluster environment and to clarify the process of porting applications or using applications already available on the cluster. It also addresses how to port...
Workshop will be conducted in Zoom in PST time. Please Register here in advance for this lecture. This workshope will be an introduction on convolutional neural network. We will start our learning on how to use PyTorch to do image processing for classic Dogs-vs-Cats problem. The knowledge of topics covered in the previous sessions is...
MPI (Message Passing Interface) is a standardized interface for portable distributed-memory scientific parallel computing. The portability ensures that a properly-written, standard-conforming MPI program can work the same way on different platforms ranging from laptop computers to massively parallel supercomputers. MPI has been widely used in advanced simulations, data analysis and visualization in the last two...
Registration: https://ucla.zoom.us/meeting/register/tJwpd-uoqDIvG91UKYByaA-7Xcbn4V38rA1k R Markdown files integrate text, Markdown, and R code into dynamic documents that weave together plain text, formatted text, and the output of the R code. The resulting dynamic reports can be produced in many formats, including HTML documents, HTML slideshows, LaTeX pdf, Beamer slideshows, MS Word doc, books, scientific articles, and websites. ...
Registration link: https://ucla.zoom.us/meeting/register/tJIvd-igpjMiGNVkod90GFNhBTuT7pwRfj5- The Hoffman2 Cluster is a powerful computational resource for the UCLA research community. This workshop is part of a three-seminar series designed to introduce users to the Hoffman2 Cluster environment and to clarify the process of porting applications or using applications already available on the cluster. It also addresses how to port...
Workshop will be conducted in Zoom in PST time. Please Register here in advance for this lecture. This workshope will be the second lecture on the introduction to convolutional neural network. We will continue our learning on how to apply data augmentation and transfer learning techniques to get a better solution for the classic Dogs-vs-Cats...
Registration: https://ucla.zoom.us/meeting/register/tJIpf--rrj4vE9DWH23IkyLvjjrk-md-hwYe This workshop covers Stata commands and methods for common data management tasks, such as identifying data errors, identifying duplicated data, specifying missing values, working with string variables, labeling variables, creating new variables, merging datasets, processing data by groups, and using loops for repetitive tasks. The workshop is focused on preparing data for statistical...