Newest Questions

Filter by
Sorted by
Tagged with
0 votes
0 answers
5 views

I want to chunk conversations from chat streams, both public group chats and direct messages. The goal is to pick up messages on a specific topic, group them into a single conversation with the ...
Mikko Ohtamaa's user avatar
0 votes
0 answers
6 views

I'm trying to train a GAN to generate new images. I tried training a StyleGAN, but the results were not good. The FID was too high, and the new images didn't look realistic. The dataset contains ...
arash's user avatar
  • 1
0 votes
1 answer
5 views

I have a complex data with wide range and skewed distribution, but i'm planning to use Regression Tree. So do I need to normalize or standardize my dataset?
Ocean's user avatar
  • 195
0 votes
0 answers
11 views

everyone! I'm doing for the first time research on how well LLMs and DL models can structuralize scattered data, through NER and RE. We are using a review paper on a domain that has no ontologies or ...
Daniel Farinha Ribeiro's user avatar
0 votes
0 answers
9 views

I am building a model which contains data from multiple clients packed into a single dataset. Model predicting success rate for every row of event. However, there is a particular feature call it ...
ind_1617's user avatar
0 votes
0 answers
4 views

Issue In Logi Symphony 25.2.0.1001-w, when configuring a measure in a Metric Set, there is normally a "Define text formatting" option to control how a numeric value is displayed. (e.g., ...
Stevoisiak's user avatar
0 votes
0 answers
11 views

i have a dataset and in each picture there are many things. What should i do for train of GANs (styleGAN) for Preprocessing that the model distinguish the things in the Picture. Now the result is not ...
arash's user avatar
  • 1
3 votes
0 answers
26 views

I have fitted a zero-inflated Poisson (ZIP) model to my count data with an excess of zeros (figure 1). Since it didn't capture the overdispersion in the data, and overdispersion is indeed ...
Paw in Data's user avatar
2 votes
0 answers
32 views

I am trying to learn Kaggle TPU and I am migrating a Flower Classification notebook from an older TPU v3-8 environment to the new TPU v5e-8 (TPU VM) environment on Kaggle. I was trying to migrate this ...
Player Mathinson's user avatar
2 votes
0 answers
31 views

I have a fairly good understanding of the basic convolution operation and how padding & stride affect its output, but dilation is something new to me. So I am going to make some broad statements ...
Yazad Pardiwala's user avatar
3 votes
2 answers
47 views

I was working on a dataset which is available on kaggle. At first, I split my data with a train-test ratio of 90:10. Then I fit 24 different models (6 different regressors with 4 different ...
ArshakParsa's user avatar
1 vote
0 answers
11 views

I am working on a project that requires machine learning analysis, but I'm new to the field and still learning about different models. I just wanted to ask the community about the best model or ...
bakingchef8's user avatar
5 votes
1 answer
48 views

I am working on a network security-related project, in which I have to build a deep learning model to detect a specific attack. It's about detecting whether a network system of an organisation is a ...
lony235's user avatar
  • 53
2 votes
0 answers
25 views

I’m working with a large language model that has been configured to behave conservatively in high-stakes contexts: it refuses unsafe or ambiguous user requests, prioritizes client welfare over ...
Rex H's user avatar
  • 21
2 votes
1 answer
16 views

I am training a VAE architecture on microscopy images. Dataset of 1000 training images, 253 testing images. Images are resized to 128x128 input or 256x256 input from original resolution which is ...
MT0820's user avatar
  • 21
0 votes
0 answers
9 views

I am currently studying GloVe paper about word embeddings. link In Section 3 The GloVe Model this model is derived from several desiderata, one of which confuses me. It is around Equation 3 which ...
Nourless's user avatar
  • 203
0 votes
0 answers
7 views

My model takes in an image of a handwritten equation and converts it into its LaTeX representation. In order to do this, it uses a ResNet50 pre-trained model for feature extraction and a transformer ...
alt_zancudo's user avatar
0 votes
0 answers
12 views

I performed Random Forest–based feature importance analysis on the MNIST dataset, focusing only on digits 0 and 1. When I visualize the importance map (see image below), it doesn’t resemble the ...
Manish Yadav's user avatar
0 votes
1 answer
16 views

I have a large set of document embeddings, and I would like to sample a subset where the median or average pairwise distance is maximized. The idea here is to get a more balanced sample set where long ...
Layman's user avatar
  • 173
0 votes
0 answers
19 views

Since point biserial is just a special case of Pearson's correlation, can I use it for data that violates the assumptions for Pearson's correlation (outliers, non normal distribution, etc)? If not, ...
Rachel's user avatar
  • 1
0 votes
0 answers
10 views

I have a large set of phrases obtained via Azure Fast Transcription, and I need to group them into coherent semantic chunks (to use later in a RAG pipeline). Initially, I tried grouping phrases based ...
Daniel's user avatar
  • 1
0 votes
0 answers
17 views

So like I have a bunch of fingerprint as a data set (my college gave me). Now I want to use these fingerprint as datasets and train a model to understand the different things. That is beside the point....
Sayan's user avatar
  • 1
1 vote
0 answers
27 views

i need to generate a word-cloud based on a free text dataset. I have tried several options, and i find the most to be of poor quality. Tools i tried: PowerBI https://www.freewordcloudgenerator.com/...
otk's user avatar
  • 141
1 vote
0 answers
46 views

I was given a project where only using Net Media Value and possibily audience considered , I have to try to estimate sales and unit return of media investment. I was asked to try to apply a Monte ...
Ana Branco's user avatar
8 votes
1 answer
2k views

I'm a software engineer working with medical device AI models that predict diseases and other conditions. For the most part, I don't design the models but I help with getting FDA clearance for them. ...
raner's user avatar
  • 183
0 votes
0 answers
14 views

Problem statement I need to model contact inactivation, defined as a contact having 12 consecutive months with no touchpoints. At any given scoring date, contacts in the base can have different ...
davide cortellino's user avatar
0 votes
0 answers
16 views

I'm using Optuna to optimize LightGBM hyperparameters, and I'm running into an issue with the variability of best_iteration across different random seeds. Current ...
invalid syntax's user avatar
0 votes
1 answer
28 views

I am tasked with giving an example of a dataset in which the presence of an outlier or a null value dramatically affects the performance of a decision tree. I've searched and searched the web and I ...
Arunabh's user avatar
1 vote
0 answers
13 views

I want to know if it’s possible to make one machine learning model that can: Find causal relationships, Stay fair to different groups, and Work well when the data changes over time or across domains. ...
Praveen Kumar's user avatar
0 votes
0 answers
7 views

Why am i getting java.lang.OutOfMemoryError: Java heap space even when i have a plenty of memory. So my simple code that create dataframe from input data, so no ...
user453575457's user avatar
0 votes
0 answers
7 views

Hi Currently I am working on sale territory optimization. I am using kmean but can not deal with some constrain that are set by business such as planning territory to deal with constrain such as max ...
Brandon Heng's user avatar
1 vote
1 answer
51 views

I am trying to replicate the results in this Metric Learning for Kernel Regression, using the metric-learn python package to perform the MLKR computation. I am completely failing to replicate one ...
Smerdjakov's user avatar
0 votes
0 answers
7 views

I have a population of customers (n=50,000) that I would like to analyze (in RStudio) and predict which product is the Next Most Likely (NML) to be bought, based on the current populations active ...
help_help_help's user avatar
0 votes
1 answer
57 views

I want to develop a machine learning model that predicts the correct medicine dosage required to keep a specific lab value within the target range of 5 to 7. I also have several other predictor ...
user2704338's user avatar
3 votes
1 answer
101 views

I have a dataset which I split into training, testing, and out-of-time sets. Then I feed my training set into K Fold CV. I understand that K Fold Cross Validation is used as a method to select the &...
user24758287's user avatar
2 votes
2 answers
125 views

This is a question asked in my homework assignment, the full question is "Is Logistic Regression actually used for regression (predicting a continuous value)? If not, state what task it really ...
astro's user avatar
  • 23
0 votes
1 answer
44 views

I'm working on binary classification problem to identify struggling students in university. I have some features that are correlated such as high_school_grade_1 that represents 75% of ...
Youness Belhaj's user avatar
3 votes
1 answer
56 views

My dataset consists of board games data: each board game is rated with a categorical variable (low, medium, high). I've plotted the LDA projection to check whether classes are linearly separable. The ...
Giulio Lanza's user avatar
0 votes
0 answers
11 views

I have a model where I incorporate additional input to language sequence data. I put these two data into two different transformers then combine them using addition. Simply, I combine them with a ...
cuneyttyler's user avatar
0 votes
1 answer
21 views

Say I have two different models with different hyperparameters and I want to compare the performance of both models on some dataset. One model is much simpler than the other and, therefore, if I were ...
Frederico Portela's user avatar
0 votes
1 answer
40 views

Hi i had a doubt regarding the nlp part in transformers that is there any preprocessing required in Bert models? if preprocessing requires what type of preprocessing requires could any one clarify ...
Ardala Gopi Krishna's user avatar
1 vote
0 answers
35 views

How can I visualise a hierarchical ontology of items in embedding space, combining text embeddings with the graphical structure? (Something similar to the example below) I have a hierarchical ...
baked goods's user avatar
1 vote
0 answers
34 views

I'd appreciate your thoughts on the following problem. I've created a heatmap plot (attached) showing the cluster membership ratio for each participant (in separate subplots) and condition (η). Now, I'...
maria mystakidou's user avatar
1 vote
1 answer
27 views

I am trying to understand what are the main differences between reinforcement learning and unsupervised learning when training ML models. I know the main difference is that reinforcement learning aims ...
Mach5's user avatar
  • 21
0 votes
0 answers
14 views

Suppose I am a free consultant for a single industry of 300 total companies, offering operational consultations. In 2024, I consulted for 100 companies. To expand my practice, I'd like to show that my ...
Santana Afton's user avatar
0 votes
0 answers
17 views

I'm working on a binary classification problem to identify struggling students, my dataset contains 10 features and 200 samples, I implement Nested CV, the distribution of the target variable is 58%/...
Youness Belhaj's user avatar
3 votes
1 answer
41 views

What I’m trying to figure out I'm working on a machine learning project and would love to hear your thoughts on two things: A. How to prioritize feature exploration B. Whether to fix hyperparameters (...
Ten's user avatar
  • 31
0 votes
0 answers
57 views

Given basic elements of a neuron(as below) with a bias value: I learnt that, a bias value allows you to shift the activation function(say sigmoid function) to the left or right, which may be critical ...
overexchange's user avatar
1 vote
0 answers
13 views

When building a binary classification model using a neural network, you have two options for outputs: output a single number from 0 to 1 using sigmoid activation, or output a probability distribution ...
Mach5's user avatar
  • 21
2 votes
0 answers
12 views

i want to build models to identify over 30 different social and non-social behaviors in pairs and groups of co-housed mice, based on markerless motion capture of their movements in top-down video ...
Vietnamese IPhO Competitant - 's user avatar

15 30 50 per page
1
2 3 4 5
727