Currently viewing the human version
Switch to AI version

What is Salesforce Data Loader?

Salesforce Data Loader Interface

Salesforce Official Logo

Data Loader is what you use when Salesforce's web importer shits the bed for the third time this week. Version 64.1.0 is the current release (Summer '25) and actually works, which is more than I can say for the previous versions.

Salesforce's built-in Data Import Wizard craps out at 50,000 records. That's cute. Data Loader handles up to 150 million records through Bulk API 2.0. I've personally pushed 2 million contact records through this thing in one go, and it didn't break a sweat - which honestly surprised the hell out of me.

How This Thing Actually Works

Data Loader GUI Screenshot

You get two ways to use Data Loader: a GUI for clicking around when you're doing one-off imports, and a CLI for automation. Here's the kick in the teeth - CLI only works on Windows. If you're on Mac and want automation, you're screwed harder than a one-legged cat in a sandbox. It's 2025 and they still haven't figured out cross-platform scripting.

The thing talks to Salesforce through their Bulk API, which is actually decent when it's not having one of its famous afternoon naps. You can tweak batch sizes, but the default 200 records per batch works fine unless you're dealing with some weird edge case that makes you question your career choices. You need Java 17+ installed, and it works on Windows 10/11 and macOS 13-15. ARM Macs finally work, thank fuck.

The data processing is actually pretty straightforward for once - your CSV gets chunked into batches, each batch gets processed asynchronously, and you get detailed success and error logs for when shit inevitably goes sideways. Unlike most Salesforce tools, the error messages actually make sense instead of just saying "ERROR: Something bad happened."

What You Can Actually Do With It

Data Loader does five basic operations:

Works with all Salesforce objects - standard stuff like Accounts and Contacts, plus whatever custom objects your admin dreamed up. Field mapping is drag-and-drop, which is nice. When things break (and they will), you get detailed CSV error logs that actually tell you what went wrong. Pro tip: 100K records takes about 20-30 minutes if everything goes right, 2 hours if it doesn't.

Security Stuff You Need to Know

OAuth 2.0 Authentication Flow Diagram

Version 64.1.0 finally ditched security tokens for OAuth 2.0 with PKCE, which means no more copy-pasting that stupid token from your email every time you reset your password. Thank god.

Data transfer is encrypted over HTTPS, and CLI credentials get encrypted locally. But here's the gotcha that'll bite you in the ass - your exported CSV files just sit there unencrypted on your hard drive like a security team's worst nightmare. If you're dealing with sensitive data, encrypt that shit yourself or store it somewhere secure. I learned this the hard way when our compliance team had a complete meltdown over exported customer data sitting in plain text in my Downloads folder for three weeks.

Set up a dedicated 'data exports' folder with encryption enabled, or your security team will make your life miserable during the next audit - and trust me, they remember everything.

The tool also supports IP restrictions and respects your org's login hours, which is actually useful for limiting when automated jobs can run.

Salesforce Data Loader vs Alternative Tools

Feature

Salesforce Data Loader

Skyvia

Dataloader.io

Data Import Wizard

Workbench

Installation

Desktop Application

Cloud-Based

Web-Based

Built-in Salesforce

Web-Based

Pricing

Free

Free plan: 10,000 records/month
Paid: $19-$99/month

Free: 10,000 records/month
Paid: $99-$299/month

Free

Free

Max Records

150 million (Bulk API 2.0)
5 million (Bulk API)

10,000/month (Free)
Unlimited (Paid)

10,000/month (Free)
Unlimited (Paid)

50,000 per import

API-limited

Supported Operations

Insert, Update, Upsert, Delete, Export

Insert, Update, Upsert, Delete, Export, Sync

Insert, Update, Upsert, Delete, Export

Insert, Update, Upsert

Insert, Update, Upsert, Delete, Export

Automation

CLI (Windows only)
Manual scheduling

Built-in scheduler
Recurring jobs

Scheduled tasks
Cloud automation

None

API-based scripts

Data Sources

CSV files only

200+ connectors
Databases, APIs, Cloud storage

CSV files
Cloud storage (Dropbox, Box)

CSV files only

CSV files
SOQL queries

Field Mapping

Drag-and-drop
Save/reuse mappings

Visual mapping
Transformations

Simple field mapping

Auto-mapping
Manual override

Manual field selection

Error Handling

Actually useful CSV logs
Shows what broke

Automated retry
Decent logging

Error reports
GUI-based review

Shows basically nothing

You're on your own

Platform Support

Windows 10/11, macOS 13-15

Any browser

Any browser

Any browser

Any browser

API Usage

Bulk API, Bulk API 2.0

REST API, Bulk API

Bulk API, REST API

SOAP API

REST API, Bulk API

Learning Curve

Moderate

  • just don't fuck up your mappings

Pretty easy

Brain-dead simple

Idiot-proof

Steep as hell

  • you need SOQL

Best For

Large-scale operations
Automated workflows

Multi-system integration
Scheduled sync

Small to medium imports
Simple automation

Occasional imports
Business users

Developers
API testing

Setup and Configuration Reality Check

Here's how you actually get Data Loader working without wanting to throw your laptop out the window.

Getting This Thing Installed

Java Installation Requirements

Download it here and install like any other desktop app. You need Java 17+ - if you don't have it, the installer will crash faster than my motivation on Monday morning. Download Java from Oracle, install it, then try again.

The "minimum" requirements are 120MB disk space and 256MB RAM, which is complete bullshit. In reality, if you're processing anything serious, give it at least 2GB RAM or it'll be slow as hell. I learned this when trying to import 500K records on a laptop with 4GB total RAM - took 6 hours and crashed twice, nearly giving my manager an aneurysm.

Check the installation troubleshooting guide if things go sideways. The macOS installation is usually smooth, but Windows can be a pain with permission issues.

Permission Bullshit You'll Deal With

Your admin needs to enable "API Enabled" permission in your user profile, which they'll probably forget to do. You'll get "INVALID_LOGIN: Invalid username, password, security token; or user locked out" and waste an hour wondering why Data Loader won't connect before realizing you need to bug them again.

You also need read/write access to whatever objects you're importing to, plus "Bulk API" permission for the good stuff. If your import fails with some cryptic "INSUFFICIENT_ACCESS" error, this is probably why.

Check the user permission checklist to make sure you have everything needed. The permission set configuration is usually cleaner than modifying profiles directly.

Batch Size: Don't Overthink It

Default batch size is 200 records, which works fine 90% of the time. You can crank it up to 2000 if you're feeling brave, but you'll probably hit some API limit or timeout. I stick with the default unless I'm in a hurry.

Bulk API 2.0 bumped the record limit from 5 million to 150 million, which is actually impressive. I've never hit either limit, but it's nice to know I probably won't.

Batch processing is straightforward - Data Loader chunks your CSV into batches, submits them asynchronously, and polls for job status updates. Monitor API usage because each batch counts against your daily limits.

Command Line Setup (Windows Only, Because Screw Mac Users)

Windows Command Line Interface

CLI automation only works on Windows, which is annoying if you're on Mac. You'll need to set up a bunch of config files:

Most people use Windows Task Scheduler to run this crap nightly. Works fine until Windows decides to randomly not run your scheduled task for no apparent reason - usually around 2am when your manager gets the alert that the nightly sync didn't run and suddenly you're debugging on your phone from bed. Check the CLI troubleshooting guide when (not if) it breaks.

Enable task history in Task Scheduler or you'll be debugging blind when it fails - learned this during a 3am troubleshooting session that lasted until 6am. When you see "Task Scheduler service is not available", the service probably crashed - restart it and cross your fingers. Also, if you get "OutOfMemoryError: Java heap space" during large imports, increase the JVM heap size in the config - this one cost me an entire weekend once.

When Shit Breaks (And It Will)

Data Loader spits out CSV error logs that actually tell you what went wrong. This is refreshing compared to most Salesforce error messages that just say "UNKNOWN ERROR" and leave you to figure it out.

Clean your data first or you'll get thousands of "invalid email format" errors that could've been caught with basic validation. Always test in sandbox first - I once accidentally updated 50,000 account names in production because I forgot to change my target URL. That was a resume-updating conversation with my manager who asked very pointedly why the board of directors was suddenly listed as "Test Account 1" through "Test Account 50,000".

When imports inevitably fail, check the error troubleshooting guide and common error solutions. The data quality checklist could've prevented most of these disasters if I'd bothered to read it first.

API Limits: The Thing That'll Ruin Your Day

Data Loader uses SOAP API for figuring out your org's structure and Bulk API for actually moving data. It's smart enough to pick the right API version automatically, which is nice.

The real pain is API limits. If you're on Professional Edition, you get like 1,000 API calls per day, which sounds like a lot until you realize each batch uses an API call. Suddenly your 200,000 record import becomes a 1,000 batch operation and you're fucked. Monitor this in Setup → System Overview → API Usage or you'll get locked out halfway through.

Use Bulk API 2.0 when possible - it's more efficient and uses fewer API calls. The API limit calculator helps plan large imports.

Frequently Asked Questions

Q

What is the maximum number of records Salesforce Data Loader can process?

A

Technically, 150 million records with Bulk API 2.0, but in reality, you'll hit API limits or suffer a mental breakdown long before that. I've pushed 5 million records through it and it took 8 hours while I slowly questioned my life choices. Your mileage will vary based on how much of a disaster your data is and whether Salesforce decided to take a coffee break that day.

Q

Does Salesforce Data Loader work on Mac computers?

A

Yeah, it works on Mac, including the new ARM chips. But here's the catch

  • CLI automation is Windows-only, because apparently Salesforce thinks it's still 2005. So if you're on Mac and want to automate imports, you're stuck with the GUI or need to spin up a Windows VM.
Q

How do I automate Salesforce Data Loader operations?

A

CLI automation requires Windows and about 2 hours of setup that should take 30 minutes if you believe in fairy tales. You'll need to encrypt your password, write XML config files that hate you personally, create field mappings, and sacrifice a goat to the Windows Task Scheduler gods. Plan for it to break randomly and require debugging during dinner when your nightly sync fails spectacularly.

Q

What file formats does Data Loader support besides CSV?

A

Just CSV. That's it. No Excel, no JSON, no XML

  • just comma-separated values like we're still using floppy disks. You'll spend half your time converting Excel files to CSV and dealing with encoding issues that make your customer names look like ancient hieroglyphics. Use UTF-8 encoding or you'll get "Invalid UTF-8 character" errors and have to explain to legal why Mr. Smith became Mr. Smth.
Q

Can I undo or rollback Data Loader operations?

A

Nope, no undo button. Export your data first if you give a shit about being able to fix things later. Deleted records go to Recycle Bin for 15 days, but updated records? You're screwed if you don't have a backup. Always test in sandbox unless you enjoy explaining to your boss why 100,000 accounts now have the wrong phone numbers.

Q

What permissions do I need to use Salesforce Data Loader?

A

"API Enabled" permission (which your admin will forget to give you), read/write access to whatever objects you're importing to, and "Bulk API" permission for the good stuff. Standard users get jack shit by default, so you'll need to bug your admin. When Data Loader gives you "INSUFFICIENT_ACCESS" errors, this is probably why.

Q

How does Data Loader handle duplicate records?

A

Use the Upsert operation with external IDs

  • it updates existing records and creates new ones. Works great if your external IDs are actually unique. If they're not, you'll get duplicate records anyway and wonder why you didn't just clean your data first. Salesforce duplicate rules can block this if they're set up, which is both helpful and infuriating.
Q

Can Data Loader connect to multiple Salesforce organizations simultaneously?

A

Nope, one org at a time like we're back in the stone age. You can run multiple instances if you hate yourself, but you'll spend more time managing connections than actually importing data. If you're dealing with multiple orgs regularly, just use a cloud tool instead of this desktop dinosaur.

Q

What are the main limitations of using Data Loader?

A

CSV-only like it's the stone age, Windows-only automation (Mac users are screwed), no scheduling in the GUI, and error handling that requires actual human intervention. Plus API limits will block you if you're not careful. It's free though, so you get what you pay for.

Q

How do I troubleshoot Data Loader connection issues?

A

First thing: is your internet actually working?

Then check if your admin gave you API permissions (they probably didn't). If you see "Unable to find a valid IP range", your firewall is blocking it. Sandbox connections need the right My Domain URL or they'll fail with "INVALID_LOGIN: invalid username or password". Corporate firewalls love blocking this too. When in doubt, try the latest version

  • they fix connection bugs constantly.
Q

Does Data Loader consume Salesforce API limits?

A

Hell yes it does. Each batch burns an API call, so 1000 batches = 1000 API calls. Professional Edition gives you like 1000/day total, so do the math. When you hit the limit, you'll get "REQUEST_LIMIT_EXCEEDED: TotalRequests Limit exceeded" and have to wait until tomorrow. Monitor this in Setup  System Overview  API Usage or you'll get locked out halfway through your import.

Q

Can I export data with complex SOQL queries in Data Loader?

A

Yeah, you can write SOQL queries for exports. Basic WHERE clauses work fine, but if you need complex joins or subqueries, just use Workbench instead. Data Loader's SOQL editor is pretty basic and will make you want to pull your hair out for anything fancy.

Q

What happens if my Data Loader operation fails midway?

A

Good news: it processes in batches, so successful batches are already committed.

Bad news: you'll need to fix the error and reprocess the failures manually. The error logs actually tell you what went wrong, which is more than most Salesforce tools give you. Clean up the bad data and run just the failures

  • don't restart the whole thing.
Q

How secure is data transmission through Data Loader?

A

Data transfer to Salesforce is encrypted via HTTPS, and OAuth 2.0 handles authentication (no more security tokens, thank god). But your CSV files sit unencrypted on your hard drive like a security team's nightmare. Encrypt that shit yourself if you're dealing with sensitive data, or use encrypted folders.

Q

Can Data Loader handle custom objects and fields?

A

Yep, custom objects work fine. Data Loader pulls the metadata automatically so you'll see all your custom fields in the field mapping. Same functionality as standard objects

  • it doesn't give a shit if it's Account or some weird custom object your admin created called "Elephant_Tracking__c".

Essential Resources and Documentation

Related Tools & Recommendations

tool
Recommended

Skyvia - Unfucks Your Data Pipeline When Everything Else Dies

competes with Skyvia

Skyvia
/tool/skyvia/overview
64%
tool
Popular choice

jQuery - The Library That Won't Die

Explore jQuery's enduring legacy, its impact on web development, and the key changes in jQuery 4.0. Understand its relevance for new projects in 2025.

jQuery
/tool/jquery/overview
60%
tool
Popular choice

Hoppscotch - Open Source API Development Ecosystem

Fast API testing that won't crash every 20 minutes or eat half your RAM sending a GET request.

Hoppscotch
/tool/hoppscotch/overview
57%
tool
Popular choice

Stop Jira from Sucking: Performance Troubleshooting That Works

Frustrated with slow Jira Software? Learn step-by-step performance troubleshooting techniques to identify and fix common issues, optimize your instance, and boo

Jira Software
/tool/jira-software/performance-troubleshooting
55%
tool
Popular choice

Northflank - Deploy Stuff Without Kubernetes Nightmares

Discover Northflank, the deployment platform designed to simplify app hosting and development. Learn how it streamlines deployments, avoids Kubernetes complexit

Northflank
/tool/northflank/overview
52%
tool
Recommended

MuleSoft Anypoint Platform - Integration Tool That Costs More Than Your Car

Salesforce's enterprise integration platform that actually works once you figure out DataWeave and survive the licensing costs

MuleSoft Anypoint Platform
/tool/mulesoft/overview
52%
review
Recommended

MuleSoft Review - Is It Worth the Insane Price Tag?

After 18 months of production pain, here's what MuleSoft actually costs you

MuleSoft Anypoint Platform
/review/mulesoft-anypoint-platform/comprehensive-review
52%
tool
Popular choice

LM Studio MCP Integration - Connect Your Local AI to Real Tools

Turn your offline model into an actual assistant that can do shit

LM Studio
/tool/lm-studio/mcp-integration
50%
tool
Recommended

PowerCenter - Expensive ETL That Actually Works

similar to Informatica PowerCenter

Informatica PowerCenter
/tool/informatica-powercenter/overview
49%
tool
Popular choice

CUDA Development Toolkit 13.0 - Still Breaking Builds Since 2007

NVIDIA's parallel programming platform that makes GPU computing possible but not painless

CUDA Development Toolkit
/tool/cuda/overview
47%
pricing
Recommended

My Hosting Bill Hit Like $2,500 Last Month Because I Thought I Was Smart

Three months of "optimization" that cost me more than a fucking MacBook Pro

Deno
/pricing/javascript-runtime-comparison-2025/total-cost-analysis
45%
tool
Recommended

JavaScript - The Language That Runs Everything

JavaScript runs everywhere - browsers, servers, mobile apps, even your fucking toaster if you're brave enough

JavaScript
/tool/javascript/overview
45%
pricing
Recommended

Should You Use TypeScript? Here's What It Actually Costs

TypeScript devs cost 30% more, builds take forever, and your junior devs will hate you for 3 months. But here's exactly when the math works in your favor.

TypeScript
/pricing/typescript-vs-javascript-development-costs/development-cost-analysis
45%
news
Popular choice

Taco Bell's AI Drive-Through Crashes on Day One

CTO: "AI Cannot Work Everywhere" (No Shit, Sherlock)

Samsung Galaxy Devices
/news/2025-08-31/taco-bell-ai-failures
45%
tool
Recommended

Fivetran: Expensive Data Plumbing That Actually Works

Data integration for teams who'd rather pay than debug pipelines at 3am

Fivetran
/tool/fivetran/overview
44%
news
Popular choice

AI Agent Market Projected to Reach $42.7 Billion by 2030

North America leads explosive growth with 41.5% CAGR as enterprises embrace autonomous digital workers

OpenAI/ChatGPT
/news/2025-09-05/ai-agent-market-forecast
42%
news
Popular choice

Builder.ai's $1.5B AI Fraud Exposed: "AI" Was 700 Human Engineers

Microsoft-backed startup collapses after investigators discover the "revolutionary AI" was just outsourced developers in India

OpenAI ChatGPT/GPT Models
/news/2025-09-01/builder-ai-collapse
40%
news
Popular choice

Docker Compose 2.39.2 and Buildx 0.27.0 Released with Major Updates

Latest versions bring improved multi-platform builds and security fixes for containerized applications

Docker
/news/2025-09-05/docker-compose-buildx-updates
40%
news
Popular choice

Anthropic Catches Hackers Using Claude for Cybercrime - August 31, 2025

"Vibe Hacking" and AI-Generated Ransomware Are Actually Happening Now

Samsung Galaxy Devices
/news/2025-08-31/ai-weaponization-security-alert
40%
news
Popular choice

China Promises BCI Breakthroughs by 2027 - Good Luck With That

Seven government departments coordinate to achieve brain-computer interface leadership by the same deadline they missed for semiconductors

OpenAI ChatGPT/GPT Models
/news/2025-09-01/china-bci-competition
40%

Recommendations combine user behavior, content similarity, research intelligence, and SEO optimization