Machine Learning Archives - DEEPLINK CORE Lab

How to split your dataset? train_test_split, KFold, StratifiedKFold

Splitting the dataset is one of the crucial steps in training machine learning models. It helps in preventing overfitting to

Retrieval-Augmented Generation (RAG) is an innovative AI technology that combines information retrieval and text generation. In this post, we will

Retrieval-Augmented Generation(RAG) 모델은 정보 검색과 텍스트 생성을 결합한 혁신적인 AI 기술이다. 이번 포스트에서는 Python을 사용하여 간단한 RAG 모델을 구현하고, 그

Structured Data What is Structured Data? Structured data is data that follows a standardized format and is well-defined in structure.

NumPy는 파이썬에서 과학 계산을 위해 널리 사용되는 라이브러리이다. 그러나 가끔 NumPy를 사용할 때 ValueError: operands could not be broadcast together

컴퓨터 비전, 이미지 인식은 컴퓨터가 이미지나 영상에서 패턴을 인식하게 하는 기술로, 의료, 자율주행, 보안 등 다양한 분야에서 활용되고 있다. 이번

데이터 보강(Data Augmentation) 이란? 기존 데이터 세트를 변형하여 새로운 데이터를 생성하는 기술로 실질적인 데이터 세트의 규모를 키울 수 있는 방법이다.이를

Mixed Data Learning은 다양한 유형의 데이터를 사용하여 학습하는 기술이다. 이러한 데이터는 텍스트, 이미지, 오디오, 비디오 등 여러 형식일 수 있다.

데이터세트 분할은 머신 러닝 모델을 학습시키기 위한 중요한 단계 중 하나이다. 머신러닝 모델 학습에 있어서 데이터세트를 학습용(train), 검증용(validation), 테스트용(test)으로 나누는

2023년 1월 3일, 언론매체 The Information은 마이크로소프트가 Bing 검색엔진에 OpenAI의 ChatGPT 기술을 적용할 것이라 보도했다. ChatGPT는 대화수준이 실제 인간의 것도