32 lines
		
	
	
		
			818 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
		
		
			
		
	
	
			32 lines
		
	
	
		
			818 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| 
								 | 
							
								---
							 | 
						||
| 
								 | 
							
								title: smart_open 1.7.1
							 | 
						||
| 
								 | 
							
								---
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								## Python smart_open 1.7.1
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								smart_open is a python library for efficient streaming of large files from/to HDFS, S3, HTTP, or local files.
							 | 
						||
| 
								 | 
							
								The simplest way to install `smart_open` is using `pip`
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								``` pip install smart_open ```
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								It is a good replacement for built-in open().
							 | 
						||
| 
								 | 
							
								It does everything open() can, and falls back to native open whenever possible.
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								#### Why?
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								It can be a pain while working with large S3 files using boto and boto3 libraries.
							 | 
						||
| 
								 | 
							
								`smart_open` builds on boto3 and is very simple to work with. 
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								``` python
							 | 
						||
| 
								 | 
							
								import smart_open
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								with smart_open.smart_open('s3://tmp-bucket/myfile.txt') as f:
							 | 
						||
| 
								 | 
							
								  lines = f.readlines()
							 | 
						||
| 
								 | 
							
								  for line in lines:
							 | 
						||
| 
								 | 
							
								    print line
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								```
							 | 
						||
| 
								 | 
							
								
							 | 
						||
| 
								 | 
							
								`smart_open` is an open source software, maintained on [GitHub](https://github.com/RaRe-Technologies/smart_open)
							 |