32 lines
		
	
	
		
			818 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			32 lines
		
	
	
		
			818 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| ---
 | |
| title: smart_open 1.7.1
 | |
| ---
 | |
| 
 | |
| ## Python smart_open 1.7.1
 | |
| 
 | |
| smart_open is a python library for efficient streaming of large files from/to HDFS, S3, HTTP, or local files.
 | |
| The simplest way to install `smart_open` is using `pip`
 | |
| 
 | |
| ``` pip install smart_open ```
 | |
| 
 | |
| It is a good replacement for built-in open().
 | |
| It does everything open() can, and falls back to native open whenever possible.
 | |
| 
 | |
| #### Why?
 | |
| 
 | |
| It can be a pain while working with large S3 files using boto and boto3 libraries.
 | |
| `smart_open` builds on boto3 and is very simple to work with. 
 | |
| 
 | |
| 
 | |
| ``` python
 | |
| import smart_open
 | |
| 
 | |
| with smart_open.smart_open('s3://tmp-bucket/myfile.txt') as f:
 | |
|   lines = f.readlines()
 | |
|   for line in lines:
 | |
|     print line
 | |
| 
 | |
| ```
 | |
| 
 | |
| `smart_open` is an open source software, maintained on [GitHub](https://github.com/RaRe-Technologies/smart_open)
 |