Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
260 views
in Technique[技术] by (71.8m points)

python - Normalize Pandas Column with skewed values

I have a pandas dataframe univariable time series data. The values of the column range between -1 and 1. However, it is left skewed with some outlier values greater than 0.5.

I want to do time series forecasting on this dataset but the predict result has been poor irrespective of the parameter tuning. I believe one issue might be with raw data - although it is between -1 and 1, the outliers might be making it difficult to predict (note these 'outliers' are valid occurrences of the event).

My question: is there a way to normalise the data such that the extreme values will not be pronounced? I tried log-normalization but wouldn't work with negative values and other scaling methods tend to keep the distribution.

question from:https://stackoverflow.com/questions/65841490/normalize-pandas-column-with-skewed-values

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...