写一段代码,要求:1.读取/Users/rachel_lrq/Desktop/浙江分词.csv 2.去除停用词/Users/rachel_lrq/Desktop/实习/哈工大停用词表.txt 3.将结果保存在一个新的csv文件中
时间: 2023-08-12 16:05:50 浏览: 127
你可以使用Python的pandas和jieba库来完成这个任务。首先,你需要安装这两个库,可以使用以下命令:
```
pip install pandas jieba
```
接下来,你可以使用以下代码来实现你的需求:
```python
import pandas as pd
import jieba
# 读取CSV文件
df = pd.read_csv('/Users/rachel_lrq/Desktop/浙江分词.csv')
# 读取停用词表
stopwords = set()
with open('/Users/rachel_lrq/Desktop/实习/哈工大停用词表.txt', 'r', encoding='utf-8') as f:
for line in f:
stopwords.add(line.strip())
# 定义分词函数
def segment(sentence):
words = jieba.cut(sentence)
words = [word for word in words if word not in stopwords]
return ' '.join(words)
# 对文本进行分词处理
df['分词结果'] = df['文本内容'].apply(segment)
# 保存结果到新的CSV文件
df.to_csv('/Users/rachel_lrq/Desktop/分词结果.csv', index=False)
```
这段代码会读取指定路径下的CSV文件,然后使用jieba库进行分词,并去除停用词。最后,将处理后的结果保存到一个新的CSV文件中。请确保替换代码中的文件路径为你自己的实际路径。
阅读全文
相关推荐






<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<title>03</title>
<script src="vue.global.js"></script>
</head>
<body>
<input type="text" v-model="search" placeholder="Search title.."/>
<label>Search title:</label>
posted by: {{ post.author }}
{{ post.title }}
<script>
class Post {
constructor(title, link, author, img) {
this.title = title;
this.link = link;
this.author = author;
this.img = img;
}
}
const {createApp} = Vue
createApp({
data() {
return {
search: '',
postList: [
new Post(
'Vue.js',
'https://vuejs.org/',
'Chris',
'https://vuejs.org//images/logo.png'
),
new Post(
'React.js',
'https://facebook.github.io/react/',
'Tim',
'https://daynin.github.io/clojurescript-presentation/img/react-logo.png'
),
new Post(
'Angular.js',
'https://angularjs.org/',
'Sam',
'https://angularjs.org/img/ng-logo.png'
),
new Post(
'Ember.js',
'http://emberjs.com/',
'Rachel',
'http://www.gravatar.com/avatar/0cf15665a9146ba852bf042b0652780a?s=200'
),
new Post(
'Meteor.js',
'https://www.meteor.com/',
'Chris',
'http://hacktivist.in/introduction-to-nodejs-mongodb-meteor/img/meteor.png'
),
new Post(
'Aurelia',
'http://aurelia.io/',
'Tim',
'https://cdn.auth0.com/blog/aurelia-logo.png'
),
new Post(
'Node.js',
'https://nodejs.org/en/',
'A. A. Ron',
'https://code-maven.com/img/node.png'
),
new Post(
'Pusher',
'https://pusher.com/',
'Alex',
'https://avatars1.githubusercontent.com/u/739550?v=3&s=400'
),
new Post(
'Feathers.js',
'http://feathersjs.com/',
'Chuck',
'https://cdn.worldvectorlogo.com/logos/feathersjs.svg'
),
]
}
},
computed: {
filteredList() {
/*TODO :在此完善代码*/
使用vue创建的一个项目即时筛选工具,在数据框输入字符后根据项目名称筛选匹配的内容
/*TODO:结束*/
}
}
}).mount('#app')
</script>
</body>
</html>










