Software -

Create and evaluate databases for RAG

Posted on August 14, 2024 | • Other languages: ja

Introduction.

Create a databases that can be used by RAG from the text data created yesterday, prepare a few specific strings, and search and evaluate them.

[Read More]

Creating text data for RAG from Wikipedia dump data

Posted on August 13, 2024 | • Other languages: ja

Motivation

I am experimenting with RAG using LangChain and was thinking about what to use for data for checking and decided to use wikipedia dump data. Since the volume of the whole is large, I decided to use data from the astronomy-related categories that I am interested in.

Here, I summarized a series of steps to extract only specific categories of data from the wikipedia dump data.

[Read More]

llama-cpp-python - impact of numpy version upgrade

Posted on July 4, 2024 | • Other languages: ja

Introduction.

NumPy 2.0.0 was released on June 16. I first noticed it the other day when I tried RAG with using langchain and got an error when building the docker container. Later, I encountered another error in CMake when trying to incorporate llama-cpp-python.

This article summarizes my responses to the two errors I recently experienced.

Background

I recently decided to learn RAG properly, I purchased a japanese book called LLM fine tuning and RAG. The book uses langchain, so I decided to create a docker container for jupyterlab that incorporates the langchain library.

[Read More]

Try RAG with LlamaIndex

Posted on May 25, 2024 | • Other languages: ja

Motivation

In this post where I tested Chatbot UI, I mentioned that one of my future challenges is to work with RAG (Retrieval Augmented Generation). In this post, I summarized how to achieve RAG using LlamaIndex.

Actually, I tried RAG using Langchain late last year. Since then, I have heard a lot of keywords with LlamaIndex, so I decided to realize RAG using LlamaIndex this time.

[Read More]

Try the Chatbot UI

Posted on May 6, 2024 | • Other languages: ja

Introduction

In a recent post, I ran the ELYZA 7B model in a local environment using llama-cpp-python. In that post, I mentioned that “about the future” I would like to try to build a system that can chat like ChatGPT.

This time, I built a system that can chat like ChatGPT on a docker container, and I summarize its contents here.

[Read More]

Running Elyza models on GPU using llama-cpp-python

Posted on May 3, 2024 | • Other languages: ja

Motivation

Quantization is essential to run LLM on the local workstation (12-16 GB of GPU memory). In this post, I summarize my attempt to maximize GPU resources using llama-cpp-python.

The content includes some of my mistakes, as I got into some areas due to my lack of understanding.

[Read More]

Measuring OpenMPI performance again using the HIMENO benchmark

Posted on March 20, 2024 | • Other languages: ja

Introduction

I have changed the hostfile that determines the order of OpenMPI execution nodes and re-measured OpenMPI performance on the Himeno benchmark as this article I posted it. After posting, I thought about it again and decided to use objective figures instead of my own judgments based on CPU and clock performance.

So this time, I decided to measure the performance of each individual workstation (node), and then decide the order of hostfile according to the results, and measure them again.

[Read More]

Re-measure OpenMPI performance using the HIMENO benchmark

Posted on March 17, 2024 | • Other languages: ja

Introduction

A month ago in this post, I measured the performance of OpenMPI with the HIMENO benchmark. My friend who saw that post pointed out some improvements regarding the order of the hostfile. In this post, I summarized the results of the performance measurement again after modifying the hostfile.

[Read More]

Rayleigh-Taylor Instability - Athena++ Tutorial 4 Additional Assignment

Posted on March 15, 2024 | • Other languages: ja

Introduction

In this article, I posted about my work on tutorial 4 of Athena++. Here, I post my work on the Rayleigh-Taylor instability challenge in 3D.

[Read More]

Visualization of Simulation Results - Athena++ Tutorial 4

Posted on March 10, 2024 | • Other languages: ja

Introduction.

Visualization of 3D magnetohydrodynamic simulation results computed in parallel in this post. The visualization is done with VisIt 3.3.3 running on a Mac.

[Read More]

Create and evaluate databases for RAG

Introduction.

Creating text data for RAG from Wikipedia dump data

Motivation

llama-cpp-python - impact of numpy version upgrade

Introduction.

Dealing with errors related to NumPy 2.0.0

Background

Try RAG with LlamaIndex

Motivation

Try the Chatbot UI

Introduction

Running Elyza models on GPU using llama-cpp-python

Motivation

Measuring OpenMPI performance again using the HIMENO benchmark

Introduction

Re-measure OpenMPI performance using the HIMENO benchmark

Introduction

Rayleigh-Taylor Instability - Athena++ Tutorial 4 Additional Assignment

Introduction

Visualization of Simulation Results - Athena++ Tutorial 4

Introduction.