Types of Data and their References

from sklearn.model_selection import train_test_split

# Splitting data into training and testing sets
train, test = train_test_split(cdf, test_size=0.2, random_state=42)

2. Model Training

When training machine learning models, you can set a random seed to ensure reproducibility. For example, with scikit-learn models:

from sklearn.ensemble import RandomForestRegressor

# Initializing a Random Forest model with a random seed
model = RandomForestRegressor(random_state=42)
model.fit(train.drop('target', axis=1), train['target'])

3. Setting Random Seed for Libraries

If you are using libraries that involve random number generation, you can set the seed globally:

NumPy

import numpy as np

np.random.seed(42)

TensorFlow

import tensorflow as tf

tf.random.set_seed(42)

PyTorch

import torch

torch.manual_seed(42)
if torch.cuda.is_available():
    torch.cuda.manual_seed(42)

픽셀펭귄의 실험실

Types of Data and their References

1. Tabular Data

2. Text Data

3. Image Data

4. Tabular Text Image

** Random Seed

1. Data Splitting

2. Model Training

3. Setting Random Seed for Libraries

티스토리툴바