Try RAG with LlamaIndex
Motivation
In this post where I tested Chatbot UI, I mentioned that one of my future challenges is to work with RAG (Retrieval Augmented Generation). In this post, I summarized how to achieve RAG using LlamaIndex.
Actually, I tried RAG using Langchain late last year. Since then, I have heard a lot of keywords with LlamaIndex, so I decided to realize RAG using LlamaIndex this time.
[Read More]Try the Chatbot UI
Introduction
In a recent post, I ran the ELYZA 7B model in a local environment using llama-cpp-python. In that post, I mentioned that “about the future” I would like to try to build a system that can chat like ChatGPT.
This time, I built a system that can chat like ChatGPT on a docker container, and I summarize its contents here.
[Read More]Running Elyza models on GPU using llama-cpp-python
Motivation
Quantization is essential to run LLM on the local workstation (12-16 GB of GPU memory). In this post, I summarize my attempt to maximize GPU resources using llama-cpp-python.
The content includes some of my mistakes, as I got into some areas due to my lack of understanding.
[Read More]Measuring OpenMPI performance again using the HIMENO benchmark
Introduction
I have changed the hostfile that determines the order of OpenMPI execution nodes and re-measured OpenMPI performance on the Himeno benchmark as this article I posted it. After posting, I thought about it again and decided to use objective figures instead of my own judgments based on CPU and clock performance.
So this time, I decided to measure the performance of each individual workstation (node), and then decide the order of hostfile according to the results, and measure them again.
[Read More]Re-measure OpenMPI performance using the HIMENO benchmark
Introduction
A month ago in this post, I measured the performance of OpenMPI with the HIMENO benchmark. My friend who saw that post pointed out some improvements regarding the order of the hostfile. In this post, I summarized the results of the performance measurement again after modifying the hostfile.
[Read More]Rayleigh-Taylor Instability - Athena++ Tutorial 4 Additional Assignment
Introduction
In this article, I posted about my work on tutorial 4 of Athena++. Here, I post my work on the Rayleigh-Taylor instability challenge in 3D.
[Read More]Visualization of Simulation Results - Athena++ Tutorial 4
Introduction.
Visualization of 3D magnetohydrodynamic simulation results computed in parallel in this post. The visualization is done with VisIt 3.3.3 running on a Mac.
[Read More]3D Magnetohydrodynamic Simulation and Parallel Computing - Athena++ Tutorial 4
Introduction
Up to this article posted last month, I have confirmed that OpenMPI can be embedded in a Docker container and used for parallel computing on multiple nodes. In this post, I will use the Docker container created above to run tutorial 4 “Running 3D MHD with OpenMP and MPI” of Athena++ on multiple nodes.
[Read More]easuring OpenMPI performance using the HIMENO benchmark
Motivation
As I stated in this post yesterday, I was able to run a program using OpenMPI on a Docker container running on multiple nodes. I wanted to find out how much performance I could improve by using OpenMPI, so I decided to benchmark it. Actually, I had some difficulties this time as well, and I would be happy if that part is helpful for others.
[Read More]