5 New Rules for Clean Code with the Pandas Library

jean.jimbo · October 17, 2023, 3:46pm

Hello, Data Scientists & Pythonistas,

We’re happy to announce the following 5 rules to help you write Clean Code with the popular Pandas library in Python:

S6734 “inplace=True” should not be used when modifying a Pandas DataFrame
S6735 Using the required parameters for “pandas.merge” or “pandas.join”
S6740 Using the required parameters for “pandas.read_csv” or “pandas.read_table”
S6741 The “pandas.DataFrame.to_numpy()” method should be preferred to the “pandas.DataFrame.values” attribute
S6742 The “pandas.pipe” method should be preferred over long chains of instructions

These rules will be available in SonarCloud shortly and will be available in SonarQube 10.3. SonarLint users can also enjoy quick fixes for rules S6741 and S6735 in the next SonarLint release.

We welcome your feedback on these rules. Do take a look at what’s coming up for Python in SonarLint, SonarQube and SonarCloud .

Jean

MaicoTimmerman · February 20, 2024, 10:26am

Hi Jean,

I’ve tried to create an issue in the issue tracker, but it seems to be closed down.

There is a false-positive in S6742 when using pyspark.sql.DataFrame:

    df = create_spark_df()  # signature def create_spark_df() -> DataFrame:
    df2 = (
        df.where(
           ... 
        )
        .withColumn(
            "name",
            ...
        )
        .where(...)
        .withColumn("count", F.lit(1))
        .transform(
            lambda df: ...
        )
        .transform(lambda df: ...)
        .withColumn(...)
    )

Due to a simple comparison with DataFrame at this line.

maksim.grebeniuk · February 29, 2024, 9:43am

Hello @MaicoTimmerman

Big thanks for the reporting.
We created the ticket to fix the issue.

Thanks,
Maksim Grebeniuk

Topic		Replies	Views
Write High Quality PySpark Python Code with SonarQube Sonar Updates python , jupyter-notebooks , pyspark , data-engineering	0	61	March 11, 2025
Rules for Clean Code with Numpy and Python Computations Sonar Updates python , numpy , data-science	0	1168	September 28, 2023
9 New Python Rules & Support for Ruff Reports Sonar Updates sonarqube , python , sonarqube-cloud	0	1642	August 25, 2023
Python 3.12 support and new rules Sonar Updates sonarqube , python , sonarqube-ide , sonarqube-cloud	0	822	November 2, 2023
False-positive on python:S4143 Report False-positive / False-negative...	1	434	October 18, 2023

5 New Rules for Clean Code with the Pandas Library

Related topics