Hypothesis

29 Matching Annotations

Dec 2023
pythonspeed.com pythonspeed.com

Reducing Pandas memory usage #2: lossy compression

2
1. GadjiMurad 15 Dec 2023
  
  in Public
  
  Technique #2: Sampling
  
  How do you load only a subset of the rows?
  
  When you load your data, you can specify a skiprows function that will randomly decide whether to load that row or not:
  
```

from random import random

def sample(row_number): ... if row_number == 0: ... # Never drop the row with column names: ... return False ... # random() returns uniform numbers between 0 and 1: ... return random() > 0.001 ... sampled = pd.read_csv("/tmp/voting.csv", skiprows=sample) len(sampled) 973 ```

sampling pandas python
2. GadjiMurad 15 Dec 2023
  
  in Public
  
  lossy compression: drop some of your data in a way that doesn’t impact your final results too much.
  
  If parts of your data don’t impact your analysis, no need to waste memory keeping extraneous details around.
  
  python pandas optimization loosy compression
Visit annotations in context

Tags

loosy compression

python

pandas

optimization

sampling

Annotators

GadjiMurad

URL

pythonspeed.com/articles/pandas-reduce-memory-lossy/
Jul 2023
app.datawars.io app.datawars.io

DataWars - Become an expert Data Scientist

1
1. hadezb 29 Jul 2023
  
  in Public
  
  The parameter by specifies the columns, and ascending takes a list to define the sorting direction per each column. In this case, we're sorting by Country name in descending order first (in lexicographical order), and by number of Employees in ascending order second.
  
  Pandas DataFrame allows for multiple sorting
  
  pandas
Visit annotations in context

Tags

pandas

Annotators

hadezb

URL

app.datawars.io/project/f27bda00-8447-4595-84a6-c678e18e6c46
www.geeksforgeeks.org www.geeksforgeeks.org

How To Do Train Test Split Using Sklearn In Python - GeeksforGeeks

3
1. borhani_matin 18 Jul 2023
  
  in Public
  
  pd.read_csv
  
  با پانداس میاد میخونه
  
  Pandas
2. borhani_matin 18 Jul 2023
  
  in Public
  
  df.head()
  
  خیلی راحت head می خونه.
  
  Pandas
3. borhani_matin 18 Jul 2023
  
  in Public
  
  X= df['Head Size(cm^3)']y=df['Brain Weight(grams)']
  
  معرفی کرد Feature و Label خودشا
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

borhani_matin

URL

geeksforgeeks.org/how-to-do-train-test-split-using-sklearn-in-python/
app.datawars.io app.datawars.io

DataWars - Become an expert Data Scientist

1
1. hadezb 11 Jul 2023
  
  in Public
  
  Bollinger bands are just a simple visualization/analysis technique that creates two bands, one "roof" and one "floor" of some "support" for a given time series. The reasoning is that, if the time series is "below" the "floor", it's a historic low, and if it's "above" the "roof", it's a historic high. In terms of stock prices and other financial instruments, when the price crosses a band, it's said to be too cheap or too expensive.
  
  How to display Bollinger bands with Pandas.
  
  pandas finance
Visit annotations in context

Tags

pandas

finance

Annotators

hadezb

URL

app.datawars.io/project/93af5053-337b-4d16-bc65-faeb1349a6fd
May 2023
www.w3schools.com www.w3schools.com

Pandas DataFrame describe() Method

1
1. borhani_matin 27 May 2023
  
  in Public
  
  Panda
  
  با استفاده از این تابع میشه برای ستون های عددی مقدار Count و Avg و غیره را بدست آورد
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

borhani_matin

URL

w3schools.com/python/pandas/ref_df_describe.asp
www.w3schools.com www.w3schools.com

Pandas DataFrame shape Property

1
1. borhani_matin 27 May 2023
  
  in Public
  
  Panda
  
  تعداد ردیف و ستون اون Data Frame را برمیگردونه.
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

borhani_matin

URL

w3schools.com/python/pandas/ref_df_shape.asp
www.w3schools.com www.w3schools.com

Pandas DataFrame head() Method

1
1. borhani_matin 27 May 2023
  
  in Public
  
  Return the first 5 rows of the DataFrame
  
  5 تا ردیف اول را برات بر میگردونه. یه ورودی هم شاید بگیره که در واقع تعداد ردیف هایی است که میخواد برگردونه
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

borhani_matin

URL

w3schools.com/python/pandas/ref_df_head.asp
www.w3schools.com www.w3schools.com

Pandas Tutorial

1
1. borhani_matin 27 May 2023
  
  in Public
  
  Pandas is a Python library.
  
  یکی از کتاب خونه های خوبه Python.
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

borhani_matin

URL

w3schools.com/python/pandas/default.asp
Apr 2023
codeberg.org codeberg.org

AGENTE_DE_TECNOLOGIA_-_Microrregião_158_-_TI_-_GABARITO_1_1682348326404_0.pdf

1
1. giobon 26 Apr 2023
  
  in Public
  
  ff = ef['x','y']
  
  Máscaras em Pandas são uma maneira de selecionar um subconjunto de dados de um DataFrame, Series ou outro objeto de dados baseado em uma condição booleana.
  
  O código que deve ser adicionado no lugar de # a fazer é:
  
  ff = ef[['x', 'y']]
  
  Isso irá selecionar apenas as colunas 'x' e 'y' do DataFrame ef, que é o resultado da máscara m. A máscara m seleciona apenas as linhas onde o valor da coluna 'z' é False, e então, ef contém apenas essas linhas. Finalmente, ff é criado selecionando as colunas 'x' e 'y' do DataFrame ef.
  
  Pandas
Visit annotations in context

Tags

Pandas

Annotators

giobon

URL

codeberg.org/giobon/pages/raw/branch/pages/assets/AGENTE_DE_TECNOLOGIA_-_Microrregião_158_-_TI_-_GABARITO_1_1682348326404_0.pdf
Dec 2021
foresttechnology.blog foresttechnology.blog

Comparing Financial Analysis with Excel and Python/Pandas

1
1. SamRose 09 Dec 2021
  
  in Public
  
  python pandas financial model cash flow
Visit annotations in context

Tags

financial model

python

cash flow

pandas

Annotators

SamRose

URL

foresttechnology.blog/2021/05/11/comparing-financial-analysis-with-excel-and-python-pandas/
Nov 2021
www.tensorflow.org www.tensorflow.org

Tutorials | TensorFlow

2
1. aries1988 26 Nov 2021
  
  in Public
  
  date_time = pd.to_datetime(df.pop('Date Time'), format='%d.%m.%Y %H:%M:%S')
  
  pandas time
2. aries1988 26 Nov 2021
  
  in Public
  
  df.describe().transpose()
  
  GP pandas eda
Visit annotations in context

Tags

eda

pandas

time

GP

Annotators

aries1988

URL

tensorflow.org/guide/data
Sep 2021
stackoverflow.com stackoverflow.com

Extrapolate values in Pandas DataFrame

1
1. SamRose 09 Sep 2021
  
  in Public
  
  pandas dataframe extrapolate curve fitting
Visit annotations in context

Tags

extrapolate

pandas

dataframe

curve fitting

Annotators

SamRose

URL

stackoverflow.com/questions/22491628/extrapolate-values-in-pandas-dataframe
arrow.apache.org arrow.apache.org

Connecting Relational Databases to the Apache Arrow World with turbodbc

1
1. SamRose 07 Sep 2021
  
  in Public
  
  python turbodbc apache arrow pandas
Visit annotations in context

Tags

pandas

python

turbodbc

apache arrow

Annotators

SamRose

URL

arrow.apache.org/blog/2017/06/16/turbodbc-arrow/
Aug 2020
nextjournal.com nextjournal.com

Data science intro with panthera

1
1. SamRose 07 Aug 2020
  
  in Public
  
  clojure python pandas
Visit annotations in context

Tags

python

clojure

pandas

Annotators

SamRose

URL

nextjournal.com/schmudde/data-science-intro-with-panthera
Mar 2020
jvns.ca jvns.ca

SQL queries don't start with SELECT - Julia Evans

1
1. pyxelr 02 Mar 2020
  
  in Public
  
  It’s just that it often makes sense to write code in the order JOIN / WHERE / GROUP BY / HAVING. (I’ll often put a WHERE first to improve performance though, and I think most database engines will also do a WHERE first in practice)
  
  Pandas usually writes code in this syntax:
  
  JOIN
  
  WHERE
  
  GROUP BY
  
  HAVING
  
  Example:
  
  df = thing1.join(thing2) # like a JOIN
  
  df = df[df.created_at > 1000] # like a WHERE
  
  df = df.groupby('something', num_yes = ('yes', 'sum')) # like a GROUP BY
  
  df = df[df.num_yes > 2] # like a HAVING, filtering on the result of a GROUP BY
  
  df = df[['num_yes', 'something1', 'something']] # pick the columns I want to display, like a SELECT
  
  df.sort_values('sometthing', ascending=True)[:30] # ORDER BY and LIMIT
  
  df[:30]
  
  pandas Python
Visit annotations in context

Tags

pandas

Python

Annotators

pyxelr

URL

jvns.ca/blog/2019/10/03/sql-queries-don-t-start-with-select/
Nov 2019
github.com github.com

pandas-dev/pandas

1
1. bourbakis 18 Nov 2019
  
  in Public
  
  data-analysis pandas flexible alignment python
Visit annotations in context

Tags

pandas

python

data-analysis

flexible

alignment

Annotators

bourbakis

URL

github.com/pandas-dev/pandas
Oct 2019
pandas.pydata.org pandas.pydata.org

pandas.read_csv — pandas 0.24.2 documentation

1
1. hmstepanek 20 Oct 2019
  
  in Public
  
  Indicate number of NA values placed in non-numeric columns.
  
  This is only true when using the Python parsing engine.
  
  Filled 3 NA values in column name
  
  If using the C parsing engine you get something like the following output:
  
  Tokenization took: 0.01 ms Type conversion took: 0.70 ms Parser memory cleanup took: 0.01 ms
  
  pandas python
Visit annotations in context

Tags

python

pandas

Annotators

hmstepanek

URL

pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html
Feb 2019
stackoverflow.com stackoverflow.com

Efficient way to loop over Pandas Dataframe to make dummy variables (1 or 0 input)

1
1. haiy 20 Feb 2019
  
  in Public
  
  Efficient way to loop over Pandas Dataframe to make dummy variables (1 or 0 input)
  
  dummy encoding
  
  pandas machine-learning
Visit annotations in context

Tags

pandas

machine-learning

Annotators

haiy

URL

stackoverflow.com/questions/33977673/efficient-way-to-loop-over-pandas-dataframe-to-make-dummy-variables-1-or-0-inpu
Jun 2018
stackoverflow.com stackoverflow.com

How to check if any value is NaN in a Pandas DataFrame

1
1. rschulz 27 Jun 2018
  
  in Public
  
  if you need to pull out these rows and examine them
  
  python pandas nan null
Visit annotations in context

Tags

python

nan

null

pandas

Annotators

rschulz

URL

stackoverflow.com/questions/29530232/how-to-check-if-any-value-is-nan-in-a-pandas-dataframe
May 2018
github.com github.com

conversion pandas.DataFrame to result and vice versa in python query (code attached) · Issue #2078 · getredash/redash

1
1. SamRose 21 May 2018
  
  in Public
  
  redash pandas
Visit annotations in context

Tags

pandas

redash

Annotators

SamRose

URL

github.com/getredash/redash/issues/2078
Apr 2018
geopandas.org geopandas.org

GeoPandas 0.3.0 — GeoPandas 0.3.0 documentation

1
1. rschulz 12 Apr 2018
  
  in Public
  
  GeoPandas
  
  python pandas GIS map
Visit annotations in context

Tags

python

GIS

map

pandas

Annotators

rschulz

URL

geopandas.org/index.html
Mar 2018
simplistic.me simplistic.me

Playing with GTFS IV - Cleanup Diaries | cjer

1
1. cjer 03 Mar 2018
  
  in Public
  
  I'll skip the inefficient method I used before with the custom groupby aggregationm, and go for some neat trick using the mighty transform method.
  
  a more constrained. and thus more efficient way to do transformations on groupbys than the apply method. You can do very cool stuff with it. For those of you who know splunk - this has the neat "streamstats" and "eventstats" capabilities
  
  pandas splunk transform evenstats streamstats
Visit annotations in context

Tags

pandas

evenstats

splunk

transform

streamstats

Annotators

cjer

URL

simplistic.me/playing-with-gtfs-iv-cleanup-diaries.html
Dec 2017
tomaugspurger.github.io tomaugspurger.github.io

datas-frame – Modern Pandas (Part 7): Timeseries

1
1. shantanuo 03 Dec 2017
  
  in Public
  
  gs.resample("5d").mean().head()
  
  pandas
Visit annotations in context

Tags

pandas

Annotators

shantanuo

URL

tomaugspurger.github.io/modern-7-timeseries
tselai.com tselai.com

Analyzing 1000+ Greek Wines With Python | Florents Tselai

2
1. shantanuo 02 Dec 2017
  
  in Public
  
  df['n_votes'] = df.n_votes.astype(int, errors='ignore')
  
  pandas
2. shantanuo 02 Dec 2017
  
  in Public
  
  ax = df['color'].value_counts().plot('bar')
  
  pandas
Visit annotations in context

Tags

pandas

Annotators

shantanuo

URL

tselai.com/greek-wines-analysis.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL