当前位置：首页 > news >正文

Convolutional Neural Networks for Sentence Classification

news 2025/8/16 10:14:05

摘要

We report on a series of experiments with
convolutional neural networks (CNN)
trained on top of pre-trained word vectors for sentence-level classification tasks.
We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific
vectors through fine-tuning offers further
gains in performance. We additionally
propose a simple modification to the architecture to allow for the use of both
task-specific and static vectors. The CNN
models discussed herein improve upon the
state of the art on 4 out of 7 tasks, which
include sentiment analysis and question
classification

任务：句子级分类任务sentence-level classification tasks
a simple CNN with little hyperparameter tuniing and static vectors.

模型架构

在这里插入图片描述
$xi∈Rkx_i \in R^k$ the k-dimensional word vector
A sentence of length: $n$
$x1:n=x1⊗x2⊗⋯,⊗xnx_{1:n} = x_1\otimes x_2\otimes \cdots,\otimes x_n$

$⊗\otimes$ is the concatenation operator.

$x_{i:i+j}$ 代表单词的拼接
$\in R^{hk}$ : 卷积滤波器。

卷积操作

在这里插入图片描述

a max-over-time pooling operation

$c^=max⁡c\hat{c} = \max{c}$

倒数第二层加入dropout ,防止过拟合。

解决了句子长度可变问题。

the penultimate layer 倒数第二层

a fully connected softmax layer

数据集

MR

SST-1:

SST-2:

Subj

TREC

CR

• MPQA

更新算法

随机梯度下降法：Adadelta 算法
预训练词向量：the publicly available word2vec vectors

模型变体

CNN-rand
CNN-static
CNN-non-static
CNN-multichannel

http://www.lryc.cn/news/19172.html

相关文章：

基于SpringBoot的共享汽车管理系统

TCP三次握手

未来土地利用模拟FLUS模型

压力传感器MPX5700D/MPX5700GP/MPX5700AP产品概述、特征

taobao.trades.sold.query( 根据收件人信息查询交易单号 )

【JavaWeb】JSON、AJAX（305-317）

AI入场，搜索这个“营销枢纽”有新故事吗？

字节在职5年，一个测试工程师的坎坷之路

什么是web框架？

说一说关系数据库中的范式建模

Mysql是怎样运行的之Inno页介绍

【华为OD机试模拟题】用 C++ 实现 - 找字符（2023.Q1）

JAVA 8 新特性 Lamdba表达式

使用antlr实现一个简单的表达式解析

2月24日作业

SpringBoot可以同时处理多少请求？

代码随想录【Day23】| 669. 修剪二叉搜索树、108. 将有序数组转换为二叉搜索树、538. 把二叉搜索树转换为累加树

Wsl2 ubuntu 配置git 阿里云codeup

展会邀约 | 昂视与您相约BTF第12届上海锂电展

RK3568平台开发系列讲解（驱动基础篇）中断子系统框架

消费复苏迎“春”暖，服装行业如何开启“狂飙”模式？

Springboot 整合Flowable工作流框架搭建

ASE0510SH-ASEMI的MOS管ASE0510SH

Golang学习Day2

Android 中malloc_debug 原理详解

D. Triangle Coloring【组合数学，乘法逆元】

【读论文】AttentionFGAN

ClickHouse 配置文件使用说明

如果不是互联网人，谁会找到这些神器?